| CARVIEW |
Select Language
HTTP/1.1 200 OK
Connection: keep-alive
Server: nginx/1.24.0 (Ubuntu)
Content-Type: text/html; charset=utf-8
Cache-Control: public, max-age=300
Content-Encoding: gzip
Via: 1.1 varnish, 1.1 varnish
Accept-Ranges: bytes
Age: 0
Date: Sun, 18 Jan 2026 20:03:13 GMT
X-Served-By: cache-dfw-kdal2120030-DFW, cache-bom-vanm7210066-BOM
X-Cache: MISS, MISS
X-Cache-Hits: 0, 0
X-Timer: S1768766594.572392,VS0,VE298
Vary: Accept, Accept-Encoding
transfer-encoding: chunked
scrapbook: collect posts of site that is wrote in config yaml using feed or scraping
[Skip to Readme]
scrapbook: collect posts of site that is wrote in config yaml using feed or scraping
Please see the README on GitHub at https://github.com/matsubara0507/scrapbook#readme
[Skip to Readme]
Modules
- ScrapBook
- ScrapBook.Cmd
- ScrapBook.Cmd.Options
- ScrapBook.Cmd.Run
- ScrapBook.Cmd
Downloads
- scrapbook-0.5.0.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
| Versions [RSS] | 0.3.2, 0.3.3, 0.5.0 |
|---|---|
| Change log | CHANGELOG.md |
| Dependencies | base (>=4.7 && <5), drinkery, extensible (>=0.5), githash, rio (>=0.1.5), scrapbook, scrapbook-core (>=0.5), yaml [details] |
| License | MIT |
| Copyright | 2018 MATSUBARA Nobutada |
| Author | MATSUBARA Nobutada |
| Maintainer | MATSUBARA Nobutada |
| Uploaded | by matsubara0507 at 2020-12-05T07:40:37Z |
| Category | Web |
| Home page | https://github.com/matsubara0507/scrapbook#readme |
| Bug tracker | https://github.com/matsubara0507/scrapbook/issues |
| Source repo | head: git clone https://github.com/matsubara0507/scrapbook |
| Distributions | |
| Executables | scrapbook |
| Downloads | 1218 total (12 in the last 30 days) |
| Rating | (no votes yet) [estimated by Bayesian average] |
| Your Rating |
|
| Status | Docs not available [build log] All reported builds failed as of 2020-12-05 [all 3 reports] |
Readme for scrapbook-0.5.0
[back to package description]scrapbook
This is cli tool that collect posts of site that is wrote in config yaml using feed or scraping.
Usage
- clone this repository or add
scrapbookpackage toextra-depsinstack.yaml - run
stack install
e.g.
$ stack exec -- scrapbook -o "example" example/sites.yaml
Docker
$ docker run --rm -v `pwd`/example:/work matsubara0507/scrapbook scrapbook sites.yaml
build docker image:
$ stack --docker build -j 1 Cabal # if out of memory in docker
$ stack --docker --local-bin-path=./bin install
$ docker build -t matsubara0507/scrapbook . --build-arg local_bin_path=./bin
Command
scrapbook [options] [input-file]
-o DIR --output=DIR Write output to DIR instead of stdout.
-t FORMAT, -w FORMAT --to=FORMAT, --write=FORMAT Specify output format. default is `feed`.
--version Show version
GHCi
>> import Control.Lens ((^.))
>> import Data.Maybe
>> conf <- fromJust <$> readConfig "example/sites.yaml"
>> (Right posts) <- collect . fmap concat $ mapM (fetch . toSite) (conf ^. #sites)
>> collect $ writeFeed "example" (fromJust $ conf ^. #feed) posts
Right ()
Example
see matsuara0507/scrapbook-example
Documentation
How to write config yaml file.
# configuration for generating Atom feed (Optional)
feed:
## write as site title to Atom feed
title: "Sample Site Posts"
## write as site url to Atom feed
baseUrl: "https://example.com"
## file name (Optional)
### if nothing, use same name from input file
name: atom.xml
# Haskeller's site configuration
sites:
## Title of site
- title: "ひげメモ"
## Author of site
author: matsubara0507
## URL of site
url: https://matsubara0507.github.io
## Feed url of site
### there are several field to set feed url
### `feed` is basic field. This field auto branch to Atom or RSS 2.0.
feed: https://matsubara0507.github.io/feed
- title: "Kuro's Blog"
author: "Hiroyuki Kurokawa"
url: https://kurokawh.blogspot.com/
### `atom` is for Atom feed.
atom:
### feed url of Atom
url: https://kurokawh.blogspot.com/feeds/posts/default
### set attr as constraint for link on each entry of Atom feed (Optional)
### if nothing, choice head. if set multiple attr, conjunction.
linkAttrs:
rel: alternate
- title: "あどけない話"
author: "kazu-yamamoto"
url: https://d.hatena.ne.jp/kazu-yamamoto
### `rss` is for RSS 2.0 feed.
### set feed url.
rss: https://d.hatena.ne.jp/kazu-yamamoto/rss2