Hello

  • Thread starter Thread starter Pohihihi
  • Start date Start date
P

Pohihihi

Hello NG,

I am currently volunteering for a local non profit organization, making few
software to run on donated computers. Few years back someone build an
application for them that saves filled forms as files in HTML. I guess at
that time that was a free way out rather using MS-Word or some other
application to make entry forms.

Anyways, problem is that they have thousands of HTML files and now they want
to change all to RTF (not doc). Good thing is that those files are using
lots simple html tags (<P>, <BR>, &nbsp; etc). No fancy stuff or CSS etc.

I know there are tones of commercial components that solves my problem but I
am looking in the direction of some thing in dotNet or older COM components
that might do some magic like that.

I have found a way to rip HTML off that file using AxBrowser (not a great
way as formatting gets messed, and yes I am avoiding Regex) but I am looking
for some thing that changes that html file to RTF directly. Any code sample,
site, or book might be a good help.

Thanks for the help.,

Po
 
Po,

Can they afford a license for Word? All you need is one copy really.
You can load the document into an instance of word, and then save it as RTF.

Yes, you would have to buy a single instance of word, but it would do
the trick.

You might even be able to find an older version of word that they have
somewhere on one of their machines, and use that license. If the HTML is
really that simple, it should be ok.

Hope this helps.
 
Thanks Nicholas,

No they don't have lic for software and saving money is core here. I guess I
will donate them my MS Word, that might just help them.

Thanks anyways,
Po


Nicholas Paldino said:
Po,

Can they afford a license for Word? All you need is one copy really.
You can load the document into an instance of word, and then save it as
RTF.

Yes, you would have to buy a single instance of word, but it would do
the trick.

You might even be able to find an older version of word that they have
somewhere on one of their machines, and use that license. If the HTML is
really that simple, it should be ok.

Hope this helps.


--
- Nicholas Paldino [.NET/C# MVP]
- (e-mail address removed)


Pohihihi said:
Hello NG,

I am currently volunteering for a local non profit organization, making
few software to run on donated computers. Few years back someone build an
application for them that saves filled forms as files in HTML. I guess at
that time that was a free way out rather using MS-Word or some other
application to make entry forms.

Anyways, problem is that they have thousands of HTML files and now they
want to change all to RTF (not doc). Good thing is that those files are
using lots simple html tags (<P>, <BR>, &nbsp; etc). No fancy stuff or
CSS etc.

I know there are tones of commercial components that solves my problem
but I am looking in the direction of some thing in dotNet or older COM
components that might do some magic like that.

I have found a way to rip HTML off that file using AxBrowser (not a great
way as formatting gets messed, and yes I am avoiding Regex) but I am
looking for some thing that changes that html file to RTF directly. Any
code sample, site, or book might be a good help.

Thanks for the help.,

Po
 
Back
Top