PageRank algorithm (1.3.4 c) Flashcards

(3 cards)

1
Q

PageRank algorithm

A

is a trademarked algorithm developed by Googles founders
used to help compile and rank website pages and the list of results returned by a search engine
it works by checking the number and quality of links to a page in order to determine roughly how important that page is
the assumption is that websites are more likely to be linked to from other websites

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

PageRank algorithm breakdown

A

PR(A) - total count of outbound links from web page n including the inbound link to page A
each page has a notational vote of 1 shared between all the web pages it links to
PR(Tn)/C(Tn) - the influence/share of the vote page A gets from page T1 through Tn
each of these vote fractions are added together and multiplied by d
d - dampening factors that prevents PR(Tn)/C(Tn) from having too much influence
d is normally set to 0.85 which equates to roughly six clickthrough links

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Applying PageRank algorithm

A

a webpages PageRank is determined in part by the PageRank of other pages linking to it
in the first instance the algorithm makes an informed guess and after several further iteration the algorithm begins to home in on the correct PageRank
the number of iterations required for the final PageRank number to stop moving can be hundreds if not millions
once the final PageRank is achieved the average PageRank of all pages will be 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly