archive.today

From Fanlore
Jump to: navigation, search
Website
Name: archive.today
Owner/Maintainer: Denis Petrov
Dates: May 16, 2012 – present
Type: storage
Fandom: The Internet
URL: https://archive.today/
https://archive.md/
https://archive.ph/
Click here for related articles on Fanlore.

Stub: This article is a stub. Please help us out by adding more content.

archive.today (formerly archive.is) is an archive site which stores snapshots of web pages.[1] It retrieves one page at a time similar to WebCite, smaller than 50MB each, but with support for JavaScript-heavy sites such as Google Maps and progressive web applications such as Twitter.

Archive.today records simultaneously two different 'snapshots' of a web-page. One is "Webpage" which includes any functional live links that are in the original. The other is "Screenshot" which provides a static and non-interactive visualization of the representation.[2]

History

Archive.today was founded in 2012. The site originally branded itself as archive.today, but in May 2015, changed the primary mirror to archive.is.[3]

In January 2019, it began to deprecate the archive.is domain in favor of the archive.today mirror.[4]

Owner and financing

The service does not run any advertisements and also doesn't fund raise.[5] It is not known who runs the webservice but according to the FAQs on the site it is privately funded and "there are no complex finances behind it".[5]

According to the WHOIS directory, the domain name archive.is is registered to Denis Petrov of Czechia.[6][7]

Features

Functionality

Archive.today can capture individual pages in response to explicit user requests.[8][9][10]

Web pages cannot be duplicated from archive.is to web.archive.org as second-level backup, as archive.is places an exclusion for Wayback Machine and does not save its snapshots in WARC format. The reverse—from web.archive.org to archive.is—is possible, but the copy usually takes more time than a direct capture. Some web sites get deleted from Internet Archive's listings retroactively or blocked from being saved due to their robots.txt file, but Archive.today does not use this.[11]

Once a web page is archived, it cannot be deleted directly by any Internet user.[12]

The search feature is backed by Google CustomSearch. If it delivers no results, archive.is attempts to utilize Yandex Search.[13] If a page has already been archived, archive.is asks the user to confirm archiving a new revision, instead of immediately archiving it.[14]

One can download archived pages as a ZIP file, except pages archived since 29 November 2019, when archive.today changed their browser engine from PhantomJS to Chromium.[15]

References

  1. ^ Brinkmann, Martin (22 April 2015). "Create publicly available web page archives with Archive.is". Ghacks. Archived from the original on 12 April 2019. Retrieved 13 June 2015.
  2. ^ Brunelle, Justin F.; Kelly, Mat; Weigle, Michele C.; Nelson, Michael L. (25 January 2015). "The impact of JavaScript on archivability" (PDF). International Journal on Digital Libraries. 17 (2): 95–117. doi:10.1007/s00799-015-0140-8. S2CID 8433375. Archived (PDF) from the original on 27 May 2019.
  3. ^ "Why did you change the URL back from archive-today to archive-is?". Archive.is Blog. 3 May 2015. Archived from the original on 1 June 2015. Retrieved 6 January 2019.
  4. ^ @archiveis (4 January 2019). "Please do not use archive.IS mirror for linking, use others mirrors [.TODAY .FO .LI .VN .MD .PH. .IS might stop working soon"] (Tweet). Archived from the original on 6 January 2019 – via Twitter.
  5. ^ a b https://archive.ph/faq
  6. ^ "Denis Petrov at Website Informer". website.informer.com. Archived from the original on 2022-04-29.
  7. ^ "Whois archive.is". www.whois.com. Archived from the original on 2022-04-29.
  8. ^ Dascalescu, Dan (18 February 2013). "Web page archiving – Dan Dascalescu's Wiki (review)". Wiki.dandascalescu.com. Archived from the original on 22 September 2013. Retrieved 3 October 2013.[Dead link]
  9. ^ Koebler, Jason (29 October 2014). "Dear GamerGate: Please Stop Stealing Our Shit". Motherboard. Archived from the original on 27 May 2019. Retrieved 22 March 2017. There is no way for a website to protect itself from having an Archive.today user mirror the site.
  10. ^ "archive.is/faq". archive.is. Retrieved 15 February 2019.
  11. ^ "Archive.today FAQ". archive.ph. Archived from the original on 2022-06-23.
  12. ^ "Some Frequently Asked Question". archive.is. 24 January 2013. Archived from the original (blog) on 26 September 2013. Retrieved 12 November 2018.
  13. ^ "Just realized that I can search for keywords in the search bar for archive today, was this a recently added feature?". Archive.is blog. Archived from the original on 2022-04-29. Retrieved 2022-01-27.
  14. ^ Archiving Websites with the Archive.is, retrieved 27 January 2022
  15. ^ "Archive.is blog". 17 July 2020. Archived from the original on 3 October 2020.