PC Review


Reply
Thread Tools Rate Thread

convert Microsoft Word Document into HTML

 
 
=?Utf-8?B?S3VtYXIuQS5QLlA=?=
Guest
Posts: n/a
 
      8th Oct 2007

I want to convert Microsoft Word Document into HTML format using .NET (c#).
Do anybody have any idea how to do this.

--
Kumar.A.P.P
Software Engineer, Infosys Technologies Limited.
http://geekswithblog.squarespace.com
 
Reply With Quote
 
 
 
 
=?Utf-8?B?UGV0ZXIgQnJvbWJlcmcgW0MjIE1WUF0=?=
Guest
Posts: n/a
 
      8th Oct 2007
There are only two ways I know of to do this.

1) Use the Word object model and "Save as HTML" on the document object.
2) Use a third party component engineered to work with Word documents.

The Word Document (except for the most recent version which is actually
well-formed XML) is a prprietary binary format. Load a Word document into
Notepad and you'll see.
-- Peter
Recursion: see Recursion
site: http://www.eggheadcafe.com
unBlog: http://petesbloggerama.blogspot.com
BlogMetaFinder: http://www.blogmetafinder.com



"Kumar.A.P.P" wrote:

>
> I want to convert Microsoft Word Document into HTML format using .NET (c#).
> Do anybody have any idea how to do this.
>
> --
> Kumar.A.P.P
> Software Engineer, Infosys Technologies Limited.
> http://geekswithblog.squarespace.com

 
Reply With Quote
 
Mark Rae [MVP]
Guest
Posts: n/a
 
      8th Oct 2007
"Kumar.A.P.P" <(E-Mail Removed)> wrote in message
news:00FE0615-7587-400A-A020-(E-Mail Removed)...

> I want to convert Microsoft Word Document into HTML format using .NET
> (c#).
> Do anybody have any idea how to do this.



http://www.aspose.com/Products/Aspos...s/Default.aspx


--
Mark Rae
ASP.NET MVP
http://www.markrae.net

 
Reply With Quote
 
Mark Rae [MVP]
Guest
Posts: n/a
 
      8th Oct 2007
"Peter Bromberg [C# MVP]" <(E-Mail Removed)> wrote
in message news:E9035A3A-3D9B-495A-B6D0-(E-Mail Removed)...

> 1) Use the Word object model and "Save as HTML" on the document object.


Since the user has posted on an ASP.NET newsgroup, I guess it's safe to
assume that he's trying to do this in ASP.NET, which obviously means that he
cannot use the Word object model since server-side Office automation doesn't
work:
http://support.microsoft.com/default...US;q257757#kb2


--
Mark Rae
ASP.NET MVP
http://www.markrae.net

 
Reply With Quote
 
John Lam
Guest
Posts: n/a
 
      3rd May 2011
You can use .NET DLL from ABCPDF to convert PDF to html.

> On Monday, October 08, 2007 7:25 AM KumarAP wrote:


> I want to convert Microsoft Word Document into HTML format using .NET (c#).
> Do anybody have any idea how to do this.
>
> --
> Kumar.A.P.P
> Software Engineer, Infosys Technologies Limited.
> http://geekswithblog.squarespace.com



>> On Monday, October 08, 2007 7:38 AM pbromber wrote:


>> There are only two ways I know of to do this.
>>
>> 1) Use the Word object model and "Save as HTML" on the document object.
>> 2) Use a third party component engineered to work with Word documents.
>>
>> The Word Document (except for the most recent version which is actually
>> well-formed XML) is a prprietary binary format. Load a Word document into
>> Notepad and you'll see.
>> -- Peter
>> Recursion: see Recursion
>> site: http://www.eggheadcafe.com
>> unBlog: http://petesbloggerama.blogspot.com
>> BlogMetaFinder: http://www.blogmetafinder.com
>>
>>
>>
>> "Kumar.A.P.P" wrote:



>>> On Monday, October 08, 2007 7:40 AM Mark Rae [MVP] wrote:


>>> http://www.aspose.com/Products/Aspos...s/Default.aspx
>>>
>>>
>>> --
>>> Mark Rae
>>> ASP.NET MVP
>>> http://www.markrae.net



>>>> On Monday, October 08, 2007 7:48 AM Mark Rae [MVP] wrote:


>>>> "Peter Bromberg [C# MVP]" <(E-Mail Removed)> wrote
>>>> in message news:E9035A3A-3D9B-495A-B6D0-(E-Mail Removed)...
>>>>
>>>>
>>>> Since the user has posted on an ASP.NET newsgroup, I guess it's safe to
>>>> assume that he's trying to do this in ASP.NET, which obviously means that he
>>>> cannot use the Word object model since server-side Office automation doesn't
>>>> work:
>>>> http://support.microsoft.com/default...US;q257757#kb2
>>>>
>>>>
>>>> --
>>>> Mark Rae
>>>> ASP.NET MVP
>>>> http://www.markrae.net




 
Reply With Quote
 
 
 
Reply

Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
How do I convert a HTML document into a MS word document. =?Utf-8?B?c2hlcnJ5IEM=?= Microsoft Word Document Management 1 16th Mar 2006 09:07 PM
Convert MS Word Document in HTML Nikolay Petrov Microsoft Dot NET Framework 3 29th Nov 2005 08:20 PM
Convert MS Word Document in HTML Nikolay Petrov Microsoft Dot NET 3 29th Nov 2005 08:20 PM
Convert MS Word Document in HTML Nikolay Petrov Microsoft VB .NET 3 29th Nov 2005 08:20 PM
Convert Word Document to HTML =?Utf-8?B?RG92IE9zaHJp?= Microsoft Word Document Management 1 4th Jun 2005 04:02 AM


Features
 

Advertising
 

Newsgroups
 


All times are GMT +1. The time now is 08:04 AM.