What Is Web Scraping
- It’s Use For Get Data From web pages
- web scraping also used for convert unstructured Data to Structured Data
- types of data :-
collapsed:: true
- Structured Data - Data base was in table format , or directly format
logseq.order-list-type:: number
- Unstructured Data - audio , Images , PDFs , LInks etc…
logseq.order-list-type:: number
- Types of Web Scrapers : -
collapsed:: true
- Self-built - make own
logseq.order-list-type:: number
- Pre-built -
logseq.order-list-type:: number
- browser extensions -
logseq.order-list-type:: number
- Software -
logseq.order-list-type:: number
- Cloud -
logseq.order-list-type:: number
- Rules: -
- it’s required permission otherwise it’s illegal or
- you can go to web or write {/robots.txt} ⇒ it’s shows permissions
- it’s must be limited or slow ( co’z for ligal issue )
How many kinds of data :-
crawler & Scraper
crawler
- it’s will open multiple links → database collection
Scraper
- it’s collect useful information → extract into [sql , xml , …]