CARVIEW |
Select Language
HTTP/2 302
server: nginx
date: Fri, 22 Aug 2025 03:14:28 GMT
content-type: text/plain; charset=utf-8
content-length: 0
x-archive-redirect-reason: found capture at 20070804051944
location: https://web.archive.org/web/20070804051944/https://wiki.python.org/moin/PythonCdTools
server-timing: captures_list;dur=0.650784, exclusion.robots;dur=0.022965, exclusion.robots.policy;dur=0.010157, esindex;dur=0.012381, cdx.remote;dur=31.405628, LoadShardBlock;dur=128.736360, PetaboxLoader3.datanode;dur=72.858861
x-app-server: wwwb-app212
x-ts: 302
x-tr: 184
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
set-cookie: wb-p-SERVER=wwwb-app212; path=/
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
HTTP/2 200
server: nginx
date: Fri, 22 Aug 2025 03:14:29 GMT
content-type: text/html;charset=utf-8
x-archive-orig-date: Sat, 04 Aug 2007 05:19:43 GMT
x-archive-orig-server: Apache/2.0.54 (Debian GNU/Linux) mod_fastcgi/2.4.2
x-archive-orig-connection: close
x-archive-guessed-content-type: text/html
x-archive-guessed-charset: utf-8
memento-datetime: Sat, 04 Aug 2007 05:19:44 GMT
link: ; rel="original", ; rel="timemap"; type="application/link-format", ; rel="timegate", ; rel="first memento"; datetime="Tue, 08 Nov 2005 06:37:25 GMT", ; rel="prev memento"; datetime="Sat, 30 Dec 2006 09:27:58 GMT", ; rel="memento"; datetime="Sat, 04 Aug 2007 05:19:44 GMT", ; rel="next memento"; datetime="Tue, 14 Aug 2007 14:15:40 GMT", ; rel="last memento"; datetime="Sat, 13 Jul 2024 23:05:34 GMT"
content-security-policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org web-static.archive.org wayback-api.archive.org athena.archive.org analytics.archive.org pragma.archivelab.org wwwb-events.archive.org
x-archive-src: 44_0_20070804045939_crawl105-c/44_0_20070804051820_crawl107.arc.gz
server-timing: captures_list;dur=0.478432, exclusion.robots;dur=0.015823, exclusion.robots.policy;dur=0.007438, esindex;dur=0.009432, cdx.remote;dur=4.509913, LoadShardBlock;dur=166.025184, PetaboxLoader3.datanode;dur=91.068180, PetaboxLoader3.resolve;dur=152.222401, load_resource;dur=104.424525
x-app-server: wwwb-app212
x-ts: 200
x-tr: 321
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
content-encoding: gzip
PythonCdTools - PythonInfo Wiki
PythonCdTools
Python Wiki
You said you wanted to mirror the Python wiki on the CD, here is a little script to suck the pages from the wiki to a folder:
1 import socket, os, sys, urllib2
2 socket.setdefaulttimeout(15)
3 from time import sleep
4
5 def suckwiki(pagelist, #url to plain text list of wiki pages
6 rawpage, #url to raw wiki text of a page
7 foldername="wikifiles", #name of folder to save files to
8 sleeptime=1 #seconds to sleep between page accesses
9 ):
10 foldername = os.path.join(os.path.abspath(os.path.dirname(sys.argv[0])), foldername)
11 if not os.path.exists(foldername): os.mkdir(foldername)
12 opener = urllib2.build_opener()
13 listrequest = urllib2.Request(pagelist)
14 listresponse = opener.open(listrequest)
15 sleep(sleeptime)
16 for pagename in listresponse:
17 pagename = pagename.strip()
18 pagename = pagename.replace('_','_5f')
19 pagename = pagename.replace(' ','_20')
20 print pagename
21 fullpagename = rawpage % {'pagename':pagename}
22 pagerequest = urllib2.Request(fullpagename)
23 page = opener.open(pagerequest)
24 f = open(os.path.join(foldername,pagename),"wb")
25 f.write(page.read())
26 f.close()
27 page.close()
28 sleep(sleeptime)
29
30 if __name__ == '__main__':
31 pagelist = "https://www.python.org/cgi-bin/moinmoin/TitleIndex?action=titleindex"
32 rawpage = r"https://www.python.org/cgi-bin/moinmoin/%(pagename)s?action=raw"
33 foldername = "pythonwiki" #name of folder to save pages to
34 suckwiki(pagelist,rawpage,foldername)
Thanks! -- ThomasWaldmann 2004-06-22 05:23:14
EditText (last edited 2004-06-22 05:23:14 by twgate)
DeleteCache (cached 2007-08-03 02:10:18)- Login
- Navigation
- Actions
- Your recent pages