%PDF- %PDF-
Direktori : /usr/lib/python2.7/site-packages/beautifulsoup4-4.6.3-py2.7.egg-info/ |
Current File : //usr/lib/python2.7/site-packages/beautifulsoup4-4.6.3-py2.7.egg-info/PKG-INFO |
Metadata-Version: 2.1 Name: beautifulsoup4 Version: 4.6.3 Summary: Screen-scraping library Home-page: http://www.crummy.com/software/BeautifulSoup/bs4/ Author: Leonard Richardson Author-email: leonardr@segfault.org License: MIT Download-URL: http://www.crummy.com/software/BeautifulSoup/bs4/download/ Description: Beautiful Soup is a library that makes it easy to scrape information from web pages. It sits atop an HTML or XML parser, providing Pythonic idioms for iterating, searching, and modifying the parse tree. # Quick start ``` >>> from bs4 import BeautifulSoup >>> soup = BeautifulSoup("<p>Some<b>bad<i>HTML") >>> print soup.prettify() <html> <body> <p> Some <b> bad <i> HTML </i> </b> </p> </body> </html> >>> soup.find(text="bad") u'bad' >>> soup.i <i>HTML</i> >>> soup = BeautifulSoup("<tag1>Some<tag2/>bad<tag3>XML", "xml") >>> print soup.prettify() <?xml version="1.0" encoding="utf-8"> <tag1> Some <tag2 /> bad <tag3> XML </tag3> </tag1> ``` To go beyond the basics, [comprehensive documentation is available](http://www.crummy.com/software/BeautifulSoup/bs4/doc/). # Links * [Homepage](http://www.crummy.com/software/BeautifulSoup/bs4/) * [Documentation](http://www.crummy.com/software/BeautifulSoup/bs4/doc/) * [Discussion group](http://groups.google.com/group/beautifulsoup/) * [Development](https://code.launchpad.net/beautifulsoup/) * [Bug tracker](https://bugs.launchpad.net/beautifulsoup/) * [Complete changelog](https://bazaar.launchpad.net/~leonardr/beautifulsoup/bs4/view/head:/NEWS.txt) # Building the documentation The bs4/doc/ directory contains full documentation in Sphinx format. Run `make html` in that directory to create HTML documentation. # Running the unit tests Beautiful Soup supports unit test discovery from the project root directory: ``` $ nosetests ``` ``` $ python -m unittest discover -s bs4 # Python 2.7 and up ``` If you checked out the source tree, you should see a script in the home directory called test-all-versions. This script will run the unit tests under Python 2.7, then create a temporary Python 3 conversion of the source and run the unit tests again under Python 3. Platform: UNKNOWN Classifier: Development Status :: 5 - Production/Stable Classifier: Intended Audience :: Developers Classifier: License :: OSI Approved :: MIT License Classifier: Programming Language :: Python Classifier: Programming Language :: Python :: 2.7 Classifier: Programming Language :: Python :: 3 Classifier: Topic :: Text Processing :: Markup :: HTML Classifier: Topic :: Text Processing :: Markup :: XML Classifier: Topic :: Text Processing :: Markup :: SGML Classifier: Topic :: Software Development :: Libraries :: Python Modules Description-Content-Type: text/markdown Provides-Extra: lxml Provides-Extra: html5lib