Skip to main content
Humanities LibreTexts

5.4: Finding Out When a Page Was Published Using Google

  • Page ID
    79174
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    Finding Out When a Page Was Published Using Google

    Many pages will tell you the date they were published. But some pages don’t give publication dates, and some can’t be trusted.

    Take, for example, this story from fake site ABCNews.co (a hoax site that attempts to to look like an ABC news site).

    pro.png

    You’ll note the publication date: November 11.

    That’s what the site looks like today. But we can see what it looked like previously, courtesy of archive.org’s Wayback Machine.

    Here’s what it looked like in March, sporting a publish date of March 24:

    hoax.png

    Here it is in June, sporting a date of June 16:

    june.png

    And in September it sported a date of September 11:

    september.png

     

    Hoax sites often do this date incrementation to increase the share rate on older stories. People are more likely to share things if they believe they are breaking news and not yesterday’s story.

    So how do we get some sense of when this story was first published?

    We can’t get there exactly but we can often use Google to get close. Google stores the date of the first time it indexed a page — on popular sites this date is usually within a couple days of the true publish date (on unknown sites it is much less reliable).

    To get Google to show the indexed date of a page, you do two things:

    • Set up a search that will only return that particular page by using the site: search term, and
    • Trigger display date but setting a date range that ends with the current day.

    Here’s what that looks like in this case:

    date-1.jpg

    As you can see, we’ve taken the URL of the page and entered

    site:abcnews.com.co/donald-trump-protester-speaks-out-i-was-paid-to-protest/

    as the search. And then we’ve used date filtering to crate a filter that doesn’t really exclude anything (its date range is all possible dates) but triggers this sort of date display in Google.

    Again, this is not a rock-solid publication date, but we can say that there was some content at this URL at this date, and in most cases, with a URL like this, that means the story was up by then.


    This page titled 5.4: Finding Out When a Page Was Published Using Google is shared under a CC BY license and was authored, remixed, and/or curated by Mike Caulfield.

    • Was this article helpful?