What Is Web Scraping

  • It’s Use For Get Data From web pages
  • web scraping also used for convert unstructured Data to Structured Data
  • types of data :- collapsed:: true
    • Structured Data - Data base was in table format , or directly format logseq.order-list-type:: number
    • Unstructured Data - audio , Images , PDFs , LInks etc… logseq.order-list-type:: number
  • Types of Web Scrapers : - collapsed:: true
    • Self-built - make own logseq.order-list-type:: number
    • Pre-built - logseq.order-list-type:: number
    • browser extensions - logseq.order-list-type:: number
    • Software - logseq.order-list-type:: number
    • Cloud - logseq.order-list-type:: number
  • Rules: -
    • it’s required permission otherwise it’s illegal or
    • you can go to web or write {/robots.txt} it’s shows permissions
    • it’s must be limited or slow ( co’z for ligal issue )

How many kinds of data :-

crawler & Scraper

crawler

  • it’s will open multiple links database collection

Scraper

  • it’s collect useful information extract into [sql , xml , …]

python lib & tools