Predict Sex From Name Person - Text Mining - Lucene - C# [on hold]

I have million collection of name and sex of person. I get it from survey and others data. I want to make validation of sex based of that data. I mean, i want to make program validation to check : When user input :



  • name : Robert Pattinson, Sex : female --> Result false

  • name : Steven Seagle, Sex : male --> Result true

  • name : Angelia Jolie, Sex : male --> Result false

  • Etc ...


User may input the combination data, may be its not exists in my collection data. User also may input the wrong name, example Robert typed Robet or Rbert or the other miss typed...


Can i realize it ? Where i have to start? Actually, i usual make program with C# / .Net?


Thanx Before