CARVIEW |
Select Language
HTTP/2 302
server: nginx
date: Thu, 17 Jul 2025 14:44:45 GMT
content-type: text/plain; charset=utf-8
content-length: 0
x-archive-redirect-reason: found capture at 20071208101751
location: https://web.archive.org/web/20071208101751/https://wiki.python.org/moin/PythonCdTools
server-timing: captures_list;dur=0.479553, exclusion.robots;dur=0.018559, exclusion.robots.policy;dur=0.009136, esindex;dur=0.013284, cdx.remote;dur=44.122050, LoadShardBlock;dur=155.978991, PetaboxLoader3.datanode;dur=64.676855, PetaboxLoader3.resolve;dur=82.186591
x-app-server: wwwb-app221
x-ts: 302
x-tr: 230
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
set-cookie: SERVER=wwwb-app221; path=/
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
HTTP/2 200
server: nginx
date: Thu, 17 Jul 2025 14:44:47 GMT
content-type: text/html;charset=utf-8
x-archive-orig-date: Sat, 08 Dec 2007 10:17:51 GMT
x-archive-orig-server: Apache/2.0.54 (Debian GNU/Linux) mod_fastcgi/2.4.2
x-archive-orig-connection: close
x-archive-guessed-content-type: text/html
x-archive-guessed-charset: utf-8
memento-datetime: Sat, 08 Dec 2007 10:17:51 GMT
link: ; rel="original", ; rel="timemap"; type="application/link-format", ; rel="timegate", ; rel="first memento"; datetime="Tue, 08 Nov 2005 06:37:25 GMT", ; rel="prev memento"; datetime="Tue, 14 Aug 2007 14:15:40 GMT", ; rel="memento"; datetime="Sat, 08 Dec 2007 10:17:51 GMT", ; rel="next memento"; datetime="Fri, 12 Aug 2011 15:24:10 GMT", ; rel="last memento"; datetime="Sat, 13 Jul 2024 23:05:34 GMT"
content-security-policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org web-static.archive.org wayback-api.archive.org athena.archive.org analytics.archive.org pragma.archivelab.org wwwb-events.archive.org
x-archive-src: 52_1_20071208084132_crawl105-c/52_1_20071208101431_crawl107.arc.gz
server-timing: captures_list;dur=0.619650, exclusion.robots;dur=0.020543, exclusion.robots.policy;dur=0.010101, esindex;dur=0.013648, cdx.remote;dur=31.777194, LoadShardBlock;dur=267.664680, PetaboxLoader3.resolve;dur=522.792939, PetaboxLoader3.datanode;dur=286.031283, load_resource;dur=591.069026
x-app-server: wwwb-app221
x-ts: 200
x-tr: 948
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
content-encoding: gzip
PythonCdTools - PythonInfo Wiki
PythonCdTools
Python Wiki
You said you wanted to mirror the Python wiki on the CD, here is a little script to suck the pages from the wiki to a folder:
1 import socket, os, sys, urllib2
2 socket.setdefaulttimeout(15)
3 from time import sleep
4
5 def suckwiki(pagelist, #url to plain text list of wiki pages
6 rawpage, #url to raw wiki text of a page
7 foldername="wikifiles", #name of folder to save files to
8 sleeptime=1 #seconds to sleep between page accesses
9 ):
10 foldername = os.path.join(os.path.abspath(os.path.dirname(sys.argv[0])), foldername)
11 if not os.path.exists(foldername): os.mkdir(foldername)
12 opener = urllib2.build_opener()
13 listrequest = urllib2.Request(pagelist)
14 listresponse = opener.open(listrequest)
15 sleep(sleeptime)
16 for pagename in listresponse:
17 pagename = pagename.strip()
18 pagename = pagename.replace('_','_5f')
19 pagename = pagename.replace(' ','_20')
20 print pagename
21 fullpagename = rawpage % {'pagename':pagename}
22 pagerequest = urllib2.Request(fullpagename)
23 page = opener.open(pagerequest)
24 f = open(os.path.join(foldername,pagename),"wb")
25 f.write(page.read())
26 f.close()
27 page.close()
28 sleep(sleeptime)
29
30 if __name__ == '__main__':
31 pagelist = "https://www.python.org/cgi-bin/moinmoin/TitleIndex?action=titleindex"
32 rawpage = r"https://www.python.org/cgi-bin/moinmoin/%(pagename)s?action=raw"
33 foldername = "pythonwiki" #name of folder to save pages to
34 suckwiki(pagelist,rawpage,foldername)
Thanks! -- ThomasWaldmann 2004-06-22 05:23:14
EditText (last edited 2004-06-22 05:23:14 by twgate)
DeleteCache (cached 2007-09-01 00:00:43)- Login
- Navigation
- Actions
- Your recent pages