|
|||||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||||
Stand-alone toolsHere are some tools that may be helpful in converting Microsoft Word files to wiki markup: Two-stage conversion from Word to MediaWikiThe following methods both perform: Quick and Dirty
Automated scriptsThe conversion can also be done using a combination of two scripts and two software packages.
> doc2mw my_word.doc
#!/bin/bash
# doc2mw - Word to MediaWiki converter
FILE=$1
TMP="$$-${FILE}"
if [ -x "./html2mw" ]; then
HTML2MW='./html2mw'
else
HTML2MW='html2mw'
fi
wvHtml --targetdir=/tmp "${FILE}" "${TMP}"
# but see also AbiWord: http://www.abisource.com/help/en-US/howto/howtoexporthtml.html
# Remove extra divs
perl -pi -e "s/\<div[^\>]+.\>//gi;" "/tmp/${TMP}"
${HTML2MW} "/tmp/${TMP}"
rm "/tmp/${TMP}"
html2mw: a perl script called by doc2mw, which uses HTML::WikiConverter to convert html -> mediawiki.
#!/usr/bin/perl
# html2mw - HTML to MediaWiki converter
use HTML::WikiConverter;
my $b;
while (<>) { $b .= $_; }
my $w = new HTML::WikiConverter( dialect => 'MediaWiki' );
my $p = $w->html2wiki($b);
# Substitutions to get rid of nasty things we don't need
$p =~ s/<br \/>//g;
$p =~ s/\ \;//g;
print $p;
Disclaimer: These scripts are probably not the best way to do this, only a possible way to do this. Please feel free to improve them. OpenOffice 2.3OpenOffice, version 2.3 can save Word documents directly to MediaWiki format. This is an excellent alternative, since OpenOffice is a free, open-source replacement for Microsoft Word.
|
| All Right Reserved © 2007, Designed by Stylish Blog. |