Analogy - Internet as a directed graph + practical importance for modelling the web
sites hyperlink to other sites –> sometimes 2-way, sometimes one-way
serves as the basis for web discoverability + page ranking for search results
web crawlers
aka bots/spiders
good webcrawler
bad webcrawler
Web indexing explained + factors influencing (TRUQD)
Analysis of web content to classify website –> use data to shape web results
factors:
- website trustworthiness
- content readability
- content uniqueness
- content quality
- duplication of existing content
TRUQD
Website is crawled, analyzed and indexxed –> how is the index info stored
search engine stores keywords + sequence of appearance + frequency of each – >used to gauge relevance to diff topics
assessing content quality (SHERMIUQ)
SHERMIUQ
(social, HTML, Engag, Rel, Mobile, Import, Upda, Quality)
page ranking - Hyperlinking + factors explained
of redirects + trustworthiness/popularity of redirects + web engagement
SEO - premise
Search engine optimization
- 3rd party company hired to increase web traffic to a website
- SEO reverse engineers the search algo –> determines what factors improve page rank
blackhat/evil SEO
black hat SEO sabotage
send bad traffic to competitors
eg redirects from shady/sketchy sites –> degrade page rank