You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A small library for extracting rich content from urls.
what does it do?
micawber supplies a few methods for retrieving rich metadata about a variety of
links, such as links to youtube videos. micawber also provides functions for
parsing blocks of text and html and replacing links to videos with rich embedded
content.
examples
here is a quick example:
importmicawber# load up rules for some default providers, such as youtube and flickrproviders=micawber.bootstrap_basic()
providers.request('https://www.youtube.com/watch?v=54XHDUOHuzU')
# returns the following dictionary:
{
'author_name': 'pascalbrax',
'author_url': u'https://www.youtube.com/user/pascalbrax''height': 344,
'html': u'<iframe width="459" height="344" src="https://www.youtube.com/embed/54XHDUOHuzU?fs=1&feature=oembed" frameborder="0" allowfullscreen></iframe>',
'provider_name': 'YouTube',
'provider_url': 'https://www.youtube.com/',
'title': 'Future Crew - Second Reality demo - HD',
'type': u'video',
'thumbnail_height': 360,
'thumbnail_url': u'https://i2.ytimg.com/vi/54XHDUOHuzU/hqdefault.jpg',
'thumbnail_width': 480,
'url': 'https://www.youtube.com/watch?v=54XHDUOHuzU',
'width': 459,
'version': '1.0',
}
providers.parse_text('this is a test:\nhttps://www.youtube.com/watch?v=54XHDUOHuzU')
# returns the following string:thisisatest:
<iframewidth="459"height="344"src="https://www.youtube.com/embed/54XHDUOHuzU?fs=1&feature=oembed"frameborder="0"allowfullscreen></iframe>providers.parse_html('<p>https://www.youtube.com/watch?v=54XHDUOHuzU</p>')
# returns the following html:<p><iframewidth="459"height="344"src="https://www.youtube.com/embed/54XHDUOHuzU?fs=1&feature=oembed"frameborder="0"allowfullscreen="allowfullscreen"></iframe></p>
About
a small library for extracting rich content from urls