User:Thunderkatz/python

refpunct.py: This is a python script that fixes punctuation and spacing around references. Requires files in pywikipedia. Outputs result to command line, after which user can copy and past over old article text. Run with "python refpunct.py name of article" (can be redirected to output file with "python refpunct.py name of article >outfile" - though this might not work because of unicode/ascii conversion problems).

Notes: Program is GIGO: anything other than the correctly spelled, case sensitive name (except for the first character) of a non-redirect page will produce an error. Program is very possibly inefficient and very possible can't handle some cases I haven't thought of.

htmlremover.py:This is a python script that removes extraneous html from google discussion pages (e.g. the leaked ZB, Fab Five, etc.). Running is done through "python htmlremover.py infile >outfile"

I release this code with no copyright (a.k.a. feel free to use, modify, burn, etc.), and no express or implied guarantee of any kind.