c# - Extract Specified Information From Microsoft Word -
suppose have word document contents below
name: blabla1 address: blablabla2 history employment: blablabla3 blablabla4 blablabla5 and questions here how , algorithm (if exist) extract information class in c#.
class employee { public string name { get; set; } public string address { get; set; } public ienumerable<string> history = new ilist<string>(); } any suggestion ?
var employee = extractmethodfromdoc(docpath);
ms word totally scriptable, via ole automation.
there no generic libraries in c# purpose. can use third-party app convert format, such ascii.
you can refer link perl module purpose:
http://www.xav.com/perl/site/lib/win32/ole.html
you can refer msdn:
http://msdn.microsoft.com/en-us/library/cc974107%28office.12%29.aspx
you can use open xml sdk 2.0 code link below:
http://www.microsoft.com/en-us/download/details.aspx?id=17985
Comments
Post a Comment