CARVIEW |
Select Language
HTTP/2 302
server: nginx
date: Thu, 17 Jul 2025 13:52:26 GMT
content-type: text/plain; charset=utf-8
content-length: 0
x-archive-redirect-reason: found capture at 20071120060305
location: https://web.archive.org/web/20071120060305/https://wiki.python.org/moin/WebClientProgramming
server-timing: captures_list;dur=0.560958, exclusion.robots;dur=0.020105, exclusion.robots.policy;dur=0.010239, esindex;dur=0.010423, cdx.remote;dur=52.695287, LoadShardBlock;dur=1386.019364, PetaboxLoader3.datanode;dur=252.776720, PetaboxLoader3.resolve;dur=997.805852
x-app-server: wwwb-app216
x-ts: 302
x-tr: 1484
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
set-cookie: SERVER=wwwb-app216; path=/
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
HTTP/2 200
server: nginx
date: Thu, 17 Jul 2025 13:52:29 GMT
content-type: text/html;charset=utf-8
x-archive-orig-date: Tue, 20 Nov 2007 06:03:05 GMT
x-archive-orig-server: Apache/2.0.54 (Debian GNU/Linux) mod_fastcgi/2.4.2
x-archive-orig-connection: close
x-archive-guessed-content-type: text/html
x-archive-guessed-charset: utf-8
memento-datetime: Tue, 20 Nov 2007 06:03:05 GMT
link: ; rel="original", ; rel="timemap"; type="application/link-format", ; rel="timegate", ; rel="first memento"; datetime="Sun, 22 May 2005 01:32:44 GMT", ; rel="prev memento"; datetime="Sat, 20 Oct 2007 05:17:34 GMT", ; rel="memento"; datetime="Tue, 20 Nov 2007 06:03:05 GMT", ; rel="next memento"; datetime="Fri, 28 Dec 2007 06:23:29 GMT", ; rel="last memento"; datetime="Sun, 29 Jan 2023 22:25:38 GMT"
content-security-policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org web-static.archive.org wayback-api.archive.org athena.archive.org analytics.archive.org pragma.archivelab.org wwwb-events.archive.org
x-archive-src: 51_0_20071120012723_crawl103-c/51_0_20071120053929_crawl107.arc.gz
server-timing: captures_list;dur=0.862477, exclusion.robots;dur=0.029178, exclusion.robots.policy;dur=0.012951, esindex;dur=0.018552, cdx.remote;dur=160.096960, LoadShardBlock;dur=1415.739130, PetaboxLoader3.resolve;dur=1051.014248, PetaboxLoader3.datanode;dur=743.895363, load_resource;dur=552.189056
x-app-server: wwwb-app216
x-ts: 200
x-tr: 2294
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
content-encoding: gzip
WebClientProgramming - PythonInfo Wiki
WebClientProgramming
Client-Side Web Programming
Libraries
utidylib and
mxTidy -- Python interfaces to
html tidy library to clean up HTML documents.
BeautifulSoup -- a permissive HTML parser.
Don't use
HTMLParser on HTML that might be invalid! That way lies pain. Either clean it up (using tidy), or use a different parser.
ClientCookie,
ClientForm, and
Mechanize are higher-level libraries for writing a web client.
mechanoid a mechanize fork.
libxml2dom can parse HTML by employing libxml2's liberal HTML parser.
Resources
Grab a document from the web - from the Python Cookbook
EditText (last edited 2007-06-26 16:45:10 by PaulBoddie)
DeleteCache (cached 2007-10-29 17:27:36)- Login
- Navigation
- Actions
- Your recent pages