Ghost Archive
   HOME

TheInfoList



OR:

List of known web archive services in-use on English Wikipedia. Sorted roughly by number of uses from most to least. The
Wayback Machine The Wayback Machine is a digital archive of the World Wide Web founded by the Internet Archive, a nonprofit based in San Francisco, California. Created in 1996 and launched to the public in 2001, it allows the user to go "back in time" and see ...
is about 80% of the total. Data initially compiled by User:GreenC as of March 2017. Updates and corrections welcome.


Archive services


Internet Archive Wayback Machine

*Article:
Wayback Machine The Wayback Machine is a digital archive of the World Wide Web founded by the Internet Archive, a nonprofit based in San Francisco, California. Created in 1996 and launched to the public in 2001, it allows the user to go "back in time" and see ...
*Domain: archive.org, waybackmachine.org *Launched: 2001 *Date range: 1996- *Hostname: , web, wayback, liveweb, www, www.web, classic-web, web-beta, replay, replay.web, web.wayback *Path: , web *Timestamp: Number 1 digit; 4–14 digits. Or "*". Or "?". Or combination. May also contain trailing chars like "re_" for (?), "if_" for frames and "im_" for images. If timestamp missing returns best available page. *Examples: ::* http://www.web.archive.org//http.. ::* http://web.archive.org/web//http.. ::* http://wayback.archive.org/http.. ::* http://web.waybackmachine.org/20081212010700/http.. *Oldest: ::* http://web.archive.org/web/0/http.. ::* http://web.archive.org/web/1/http.. *Newest: ::* http://web.archive.org/web/2/http.. *Index: ::* http://web.archive.org/web/*/http.. * Submit page: ::* http://web.archive.org/save/http.. ::* http://web.archive.org/record/http.. * Interval between manual captures: 1 hour. * Prefix searching (except after the question mark in query strings)


Archive.Today

*Article:
archive.today archive.today (or archive.is) is a web archiving site, founded in 2012, that saves snapshots on demand, and has support for JavaScript-heavy sites such as Google Maps and progressive web apps such as Twitter. archive.today records two snaps ...
*Domain: .today, .is, .fo, .li, .vn, .md, .ph *Launched: 2012 *Hostname: , www *Path: *Timestamp: 4–14 digits; or digits + characters (see example) *Examples: ::* http://archive.is/20130101/http.. ::* http://archive.is/2013.04.17-12:08:20/http.. ::* http://archive.is/http.. (index page) * Submit page: http://archive.today/?run=1&url=... (requires JavaScript) * Interval between manual captures: 1 week. * Prefix searching with preview screenshots * Page history with preview screenshots Archive.Today represents captured pages as a static snapshot, rendered by the Archive.Today server, and uses a fixed-width layout. Page resources such as JavaScript and CSS files are not retained separately. For example, styling from a separate CSS file is converted to inline CSS styling, embedded in the HTML source code. Archived pages are initially served through their short URL format, an identifier with five case-sensitive alphanumerical characters and four characters on early captures from 2012. To obtain the long URL format with time stamp and the source URL, click "share" in the top menu or append "/share" to the URL. The full URL is listed in the window. If a redirect page is saved, Archive.Today stores both the URL of the redirect page and the URL of the redirect target. The archived page can be found by entering either URL. ;Additional restrictions As of 2023, copies of Archive.org pages can only be saved once. This restriction applies as well to the digital library (archive.org/details/) which is subject to change, not only the Wayback Machine where pages are, besides infrequent exclusions, not subject to change anyway. If a "Welcome to nginx!" page appears, it apparently either means the user has hit a rate limit or the site is doing maintenance work.


Web Citation (WebCite)

*Article: WebCite *Domain: webcitation.org :Deprecated—no longer accepting new archive requests. Site generally unstable, abandoned, and features not working *Hostname: , www *Path: base62ID, query, cache, getfile.php, *Timestamp: None. Uses &date=2012-06-01+21:40:03 in query?url ; the short ID is base62 which converts to unix time *Examples: ::* http://www.webcitation.org/gT64fd ::* http://www.webcitation.org/66lmEkpE8?url=http://www.ariacharts.com.au/pages/charts_display_album.asp?chart%3D1G50 ::* http://www.webcitation.org/query?id=1138911916587475 ::* http://www.webcitation.org/query?url=http..&date=2012-06-01+21:40:03 ::* http://www.webcitation.org/1138911916587475 ::* http://www.webcitation.org/cache/73e53dd1f16cf8c5da298418d2a6e452870cf50e ::* http://www.webcitation.org/getfile.php?fileid=1c46e791d68e89e12d0c2532cc3cf629b8bc8c8e


National Archives UK

*Domain: nationalarchives.gov.uk *Hostname: webarchive, yourarchives *Path: *Timestamp: 4–14 digits *Examples: ::* http://webarchive.nationalarchives.gov.uk/20110311030350/http://www.abilityvability.co.uk/ ::* http://yourarchives.nationalarchives.gov.uk/index.php?title=Backbone_radio_link_and_radio_standby_to_line_links_for_safeguarding_vital_communications


Australian Web Archive

*Article: Australian Web Archive (
Trove Trove is an Australian online library database owned by the National Library of Australia in which it holds partnerships with source providers National and State Libraries Australia, an aggregator and service which includes full text document ...
) *Domain: nla.gov.au *Hostname: webarchive *Path: see examples. First-level path can contain "awa" or "wayback". Second-level path might contain a pandora.nla.gov.au URL with a third-level destination URL which may or may not have a scheme (http://). Or the second-level path might be the final destination URL. A special archive index page URL is also available as "/tep/" *Timestamp: Two types (20120727-0512, 20120326012340) *Examples: ::*https://webarchive.nla.gov.au/awa/20120726200849/http://pandora.nla.gov.au/pan/14231/20120727-0512/www.howlspace.com.au/en2/inxs/inxs.htm ::*https://webarchive.nla.gov.au/awa/20110824211656/http://pandora.nla.gov.au/pan/128344/20110810-1451/www.theaureview.com/guide/festivals/bam-festival-2010-ivorys-rock-qld.html ::*https://webarchive.nla.gov.au/awa/20010328130000/http://www.howlspace.com.au/en2/arenatina/arenatina.htm ::*https://webarchive.nla.gov.au/wayback/20120326012340/http://news.defence.gov.au/2011/09/09/army-airborne-insertion-capability/ ::*https://webarchive.nla.gov.au/gov/20070831165847/http://www.defence.gov.au/opacolyte/default.cfm ::*https://webarchive.nla.gov.au/tep/23790 (redirect from http://pandora.nla.gov.au/tep/23790 ) .. index page :Note: The Australian Web Archive incorporates the Pandora archive as well as the Australian Government Web Archive and the
National Library of Australia The National Library of Australia (NLA), formerly the Commonwealth National Library and Commonwealth Parliament Library, is the largest reference library in Australia, responsible under the terms of the ''National Library Act 1960'' for "mainta ...
's archive of the domain. :Note: No memento access


NLA Australia (deprecated)

: Deprecated. Integrated into Australian Web Archive (Trove) above. *Domain: nla.gov.au *Hostname: pandora, trove, tep, webarchive, content.webarchive *Path: see examples. The /pan/ regex should be /pan/ -9 *Timestamp: Three types (20120727-0512, S2000-Dec-5, 20120326012340) *Examples: ::*http://pandora.nla.gov.au/pan/14231/20120727-0512/www.howlspace.com.au/en2/inxs/inxs.htm ::*http://pandora.nla.gov.au/pan/128344/20110810-1451/www.theaureview.com/guide/festivals/bam-festival-2010-ivorys-rock-qld.html ::*http://pandora.nla.gov.au/nph-wb/20010328130000/http://www.howlspace.com.au/en2/arenatina/arenatina.htm ::*http://pandora.nla.gov.au/nph-arch/2000/S2000-Dec-5/http://www.paralympic.org.au/athletes/athleteprofile60da.html ::*http://pandora.nla.gov.au/tep/23790 ::*http://webarchive.nla.gov.au/gov/20120326012340/http://news.defence.gov.au/2011/09/09/army-airborne-insertion-capability/ ::*http://content.webarchive.nla.gov.au/gov/wayback/20120326012340/http://news.defence.gov.au/2011/09/09/army-airborne-insertion-capability *Note: Not to be confused with non-webarchive URLs that appear similar: ::*http://pandora.nla.gov.au/pan/23790/20080220-0000/issue935.pdf ::*http://nla.gov.au/nla.obj-291093018/view?partId=nla.obj-291093220 ::*http://trove.nla.gov.au/ndp/del/article/33084123 :Note: No memento access


Ghost Archive

* Domain: ghostarchive.org * Launched: ~2021 * Hostname:
one 1 (one, unit, unity) is a number representing a single or the only entity. 1 is also a numerical digit and represents a single unit of counting or measurement. For example, a line segment of ''unit length'' is a line segment of length 1. I ...
* Path: archive, varchive/, iarchive/ * Timestamp: 4 to 14 digits * Examples: ::* https://ghostarchive.org/archive/fwAS7 (short-form) ::* https://ghostarchive.org/archive/20210728022510/https://rms-support-letter.github.io/ ::* https://ghostarchive.org/varchive/UhCiGY75wVw (short form) ::* https://ghostarchive.org/varchive/youtube/20100711020000/UhCiGY75wVw ::* https://ghostarchive.org/iarchive/instagram/georgemofficial/1374848874216391600 ::* https://ghostarchive.org/iarchive/s/instagram/BMUccRPg4Ow ::* https://ghostarchive.org/archive/20210728022510/https://www.instagram.com/p/BMUccRPg4Ow/ :Note: to convert short-form to long: ::For regular web pages: ::* https://ghostarchive.org/longurl/o9UcZ -> https://ghostarchive.org/archive/20210728022510/https://rms-support-letter.github.io/ ::For video pages, e.g. YouTube: ::* http://ghostarchive.org/vlongurl/UhCiGY75wVw -> https://ghostarchive.org/varchive/youtube/20100711020000/UhCiGY75wVw :To find the earliest and latest archive available use a timestamp of "1990" or "3000" e.g. :* https://ghostarchive.org/archive/1990/https://rms-support-letter.github.io/ will find the earliest archived copy of that webpage, while :* https://ghostarchive.org/archive/3000/https://rms-support-letter.github.io/ will find the latest. : The /archive/ path long form will work for all types of archives, for example https://ghostarchive.org/archive/20210728022510/https://youtube.com/watch?v=UhCiGY75wVw will redirect to the video, and https://ghostarchive.org/archive/20210728022510/https://www.instagram.com/p/BMUccRPg4Ow/ will redirect to the image. * Prefix searching with preview screenshots * Page history with preview screenshots Ghost Archive uses the WARC ("webarchive") format to store saved pages, meaning the verbatim content of the page resources can be recreated. When opened, Ghost Archive uses the
Webrecorder Rhizome is an American not-for-profit arts organization that supports and provides a platform for new media art. History Artist and curator Mark Tribe founded Rhizome as an email list in 1996 while living in Berlin./@ or /c format, whichever is available. If the archived page redirects to a different URL, only the target URL is displayed. This means the archived page can not be opened by entering the URL of the redirecting page.


Megalodon.jp

* Site name: web gyotaku * Date range: 2007- * Article: Megalodon (website) * Domain: megalodon.jp * Examples: https://megalodon.jp/2023-0522-0234-30/https://gstreamer.freedesktop.org:443/download/ * No prefix searching Similar to Archive.Today, Megalodon.jp represents archived pages as a static HTML snapshot. However, pictures are converted into BASE64 data: URLs inside the resulting HTML data, and there is no fixed width like Archive.Today . Megalodon lets the user decide whether to save the desktop or mobile version of a page, meaning the version that appears to desktop computer and laptop users, or to smartphone users. Using https://megalodon.jp/(full URL) (example: https://megalodon.jp/https://gstreamer.freedesktop.org:443/download/ ) can check if Megalodon archived any copy of a particular URL. http and https are treated separately. If the archived page is a redirect to a different URL, only the URL prior to the redirect is saved. In that case, the archived page can not be opened by entering the target URL
example
.


FreezePage

*Domain: freezepage.com *Hostname: , www *Path: *Timestamp: (only available via web scrape) *Examples: ::*http://www.freezepage.com/1338238555ICJBKARMZN ::*http://www.freezepage.com/1343081512QUPLJKJOYU?url=http://www.telegraph.co.uk/.. :Note: If the account ID which created the snapshot expires for lack of activity (no login to freezepage), the snapshot is deleted from freezepage.com :Note: No memento access


Library of Congress

*Domain: loc.gov *Hostname: webarchive *Path: all, lcwa#### *Timestamp: 4–14 digits *Examples: ::* http://webarchive.loc.gov/all/20160110110238/https://www.whitehouse.gov/ ::* http://webarchive.loc.gov/lcwa0010/20111109051100/http


Arquivo.pt (Portugal)

*Domain: arquivo.pt *Hostname: *Path: wayback, wayback/wayback, noFrame/replay *Timestamp: 4–14 digits ... might contain "mp_" see example *Examples: ::* http://arquivo.pt/wayback/19980205082901/http://www.caleida.pt/saramago/ ::* http://arquivo.pt/wayback/wayback/20091010102944/http.. ::* http://arquivo.pt/noFrame/replay/20091010102944mp_/http..


Stanford University web archive

*Domain: stanford.edu *Hostname: swap *Path: was (optional but standard) *Timestamp: 4–14 digits *Examples: ::* https://swap.stanford.edu/was/19940102000000/http://slacvm.slac.stanford.edu/FIND/slac.html ::* https://swap.stanford.edu/19940102000000/http://slacvm.slac.stanford.edu/FIND/slac.html ::* https://sul-swap-prod.stanford.edu/19940102000000/http://slacvm.slac.stanford.edu/FIND/slac.html deprecated


Archive-It

*Domain: archive-it.org *Hostname: wayback *Path: "all", a 3–5 digit number; "org-" followed by a 3–4 digit number *Timestamp: 4–14 digits; "0" or "1" for oldest; "2" for newest; "*" for index *Examples: ::* https://wayback.archive-it.org/all/20190621232545/http://example.com/ ::* https://wayback.archive-it.org/3348/20151201214156/https://www.heritagepreservation.org/ ::* https://wayback.archive-it.org/org-467/20191016094633/http://quartos.org/ *Oldest: ::* https://wayback.archive-it.org/all/1/http://example.com/ ::* https://wayback.archive-it.org/3348/1/https://www.heritagepreservation.org/ ::* https://wayback.archive-it.org/org-467/1/http://quartos.org/ *Newest: ::* https://wayback.archive-it.org/all/2/http://example.com/ ::* https://wayback.archive-it.org/3348/2/https://www.heritagepreservation.org/ ::* https://wayback.archive-it.org/org-467/2/http://quartos.org/ *Index: ::* https://wayback.archive-it.org/all/*/http://example.com/ ::* https://wayback.archive-it.org/3348/*/https://www.heritagepreservation.org/ ::* https://wayback.archive-it.org/org-467/*/http://quartos.org/


BibAlex

*Domain: bibalex.org:80 *Hostname: web.archive, web.petabox *Path: web *Timestamp: 4–14 digits *Examples: ::* http://web.archive.bibalex.org/web/20051231070651/http://www.heimskringla.no/original/heimskringla/ynglingasaga.php ::* https://web.petabox.bibalex.org/web/20060521125008/http://developmentgap.org/rmalenvi.html *Portal entry: https://www.bibalex.org/isis/frontend/archive/archive_web.aspx *Example URLs above are down as of March 2024. Might be temporary.


WikiWix

*Domain: wikiwix.com *Hostname: archive *Path: cache *Timestamp: 4–14 digits *Examples: ::*https://archive.wikiwix.com/cache/20180329074145/http://www.linterweb.fr ::*https://archive.wikiwix.com/cache/?url=http://www.linterweb.fr :Note: Does not support Memento :Note: API access added in March 2018. By appending &apiresponse=1 to the end of the URL. (https://archive.wikiwix.com/cache/?url=http://www.linterweb.fr&apiresponse=1). This may require encoding of any other & in the url= section :Note: Supports &title argument at end of URL not part of the source URL (similar to &apiresponse). Gives the name of the Wikipedia article the link is being used in (optional).


National Archives US

*Domain: webharvest.gov *Hostname: *Path: *Timestamp: 4–14 digits *Examples: ::*http://webharvest.gov/peth04/20041022004143/http://www.ftc.gov/os/statutes/textile/alerts/dryclean


National Archives Iceland

*Domain: vefsafn.is *Hostname: wayback *Path: wayback *Timestamp: 4–14 digits *Examples: ::* http://wayback.vefsafn.is/wayback/20110318105639/http://www.twitter.com/yagirldwoods


Europa Archives, Ireland (deprecated)

: Deceased. In May 2018 all archives were moved to collections.internetmemory.org then as of September 2018, all archives were moved again to Archive-I

*Domain: europarchive.org *Hostname: collection *Path: nli *Timestamp: 4–14 digits *Move example: :*Original: http://collection.europarchive.org/nli/20141013204117/http://www.defense.gov/ :*Move 1: http://collections.internetmemory.org/nli/20141013204117/http://www.defense.gov/ :*Move 2: http://wayback.archive-it.org/10702/20141013204117/http://www.defense.gov/


Perma CC

*Domain: perma-archives.org, perma.cc *Hostname: *Path: , warc *Timestamp: 4–14 digits for perma-archives.org, or snapshot ID *Examples: ::* http://perma-archives.org/warc/20140729143852/http ::* http://perma.cc/F9NT-22AK ::* https://perma-archives.org/warc/F9NT-22AK/http://www.goduke.com/ViewArticle.dbml?SPSID=25943&SPID=2027&DB_LANG=C&DB_OEM_ID=4200&ATCLID=152476


Proni Web Archives (deprecated)

: Deceased. As of October 2018, all archives moved to Archive-I

*Domain: proni.gov.uk *Hostname: webarchive *Path: *Timestamp: 4–14 digits *Examples: ::* http://webarchive.proni.gov.uk/20111213123846/http * Move example: :*Original: http://webarchive.proni.gov.uk/20100218151844/http://www.berr.gov.uk/ :*New: http://wayback.archive-it.org/11112/20100218151844/http://www.berr.gov.uk/


Parliament UK

*Domain: parliament.uk *Hostname: webarchive *Path: *Timestamp: 4–14 digits *Examples: ::* http://webarchive.parliament.uk/20160204060058tf_/http://www.parliament.uk/about/living-heritage/building/palace/big-ben/


UK Web Archive (British Library)

*Domain: webarchive.org.uk *Hostname: www *Path: wayback/archive *Timestamp: 4–14 digits with possibility of "mp_" at end *Examples: ::* http://www.webarchive.org.uk/wayback/archive/20100602000217/www.westsussex.gov.uk/ccm/navigation/your-council/election ::* https://www.webarchive.org.uk/wayback/archive/20151128210021mp_/http://newsroom.herefordshire.gov.uk/2006/november/new%2Dsculpture%2Dto%2Dbe%2Dhanded%2Dover.aspx


Libraries and Archives Canada (deprecated)

:Deceased. As of May 2018 all archives moved to webarchive.bac-lac.gc.c

*Domain: collectionscanada.gc.ca *Hostname: www *Path: archivesweb, webarchives *Timestamp: 4–14 digits *Examples: ::* http://www.collectionscanada.gc.ca/webarchives/20061104084225/http://broadband.gc.ca/maps/province.html?prov=48 ::* http://www.collectionscanada.gc.ca/archivesweb/20060209004933/http *Note: Not to be confused with other close URL variants. Only capture "/webarchives/" or "/archivesweb/" * Move example: :*Original: http://www.collectionscanada.gc.ca/webarchives/20061104084225/http://broadband.gc.ca/maps/province.html?prov=48 :*New: http://webarchive.bac-lac.gc.ca:8080/wayback/20061104084225/http://broadband.gc.ca/maps/province.html?prov=48


Libraries and Archives Canada (www.bac-lac.gc.ca)

:As of September 2022 the new website is https://library-archives.canada.ca/eng - most of the www.bac-lac.gc.ca may no longer be available but some links still work see

*Domain: bac-lac.gc.ca:8080 *Hostname: webarchive, www *Path: wayback *Timestamp: 4–14 digits *Examples: ::* http://webarchive.bac-lac.gc.ca:8080/wayback/20051228174058/http://nationalatlas.gov/ *Note: Formerly collectionscanada.gc.ca see above. As of September 2022, links from collectioncanada.gc.ca were entirely removed by LA


Catalonian Archive

*Domain: padi.cat(:8080)? *Hostname: www, (none) *Path: wayback *Timestamp: 4–14 digits *Examples: ::* http://www.padi.cat:8080/wayback/20140404212712/http ::* http://www.padi.cat/wayback/20140404212712/http


Web Archives Singapore

*Domain: nlb.gov.sg *Hostname: eresources *Path: webarchives/wayback *Timestamp: 4–14 digits *Examples: ::* https://eresources.nlb.gov.sg/webarchives/2016-04-25%2019:07:06.000/wp/details/http://www.lta.gov.sg/apps/news/page.aspx?c=2&id=2dzk9l67sx9j40a1rhgdw3hvhrnxgq3zh34l77r37dj4w72jf1 ::*https://eresources.nlb.gov.sg/webarchives/wayback/20160425174854/https://www.lta.gov.sg/apps/news/page.aspx?c=2&id=2dzk9l67sx9j40a1rhgdw3hvhrnxgq3zh34l77r37dj4w72jf1 *Note: Not to be confused with other close URL variants. Only capture "/webarchives/wayback/"


Slovenian Archives (Spletni)

*Domain: nuk.uni-lj.si:8080 *Hostname: nukrobi2 (may change) *Path: wayback *Timestamp: 4–14 digits *Examples: ::* http://nukrobi2.nuk.uni-lj.si:8080/wayback/20160203130917/http


Estonia Archives

*Domain: digar.ee *Hostname: veebiarhiiv *Path: a *Timestamp: 4–14 digits *Examples: ::* http://veebiarhiiv.digar.ee/a/20131014091520/http://rakvere.kovtp.ee/en_GB/twin-cities


Bavarian Archives

*Domain: bib-bvb.de *Hostname: langzeitarchivierung *Path: wayback *Timestamp: 4–14 digits *Examples: ::* http://langzeitarchivierung.bib-bvb.de/wayback/20121004142737/http://www.schwabenkrieg.historicum-archiv.net/


York University Digital Library

*Domain: yorku.ca *Hostname: digital.library *Path: wayback *Timestamp: 4–14 digits *Examples: ::* https://digital.library.yorku.ca/wayback/20160129214328/http://en.cijnews.com/?p%3D10033


National Library of Israel

* Domain: wayback.nli.org.il * Path: '''' * Timestamp: 14 digits * Format: http://wayback.nli.org.il// Appears to have poor coverage.


Other

;Memento Web *https://timetravel.mementoweb.org/memento/2010/http://www.muslimdirectory.co.uk/displayresults.php?PHPSESSID=f0fb8b41d8758983e7d43cddb556b9df&businesstype=1&orgtype=&country=UK&city=Cardiff :Note: Redirects to an external archive service based on cached data in the Memento database which can fluctuate and/or be inaccurate due to the cache going out of sync with the client service. ;
Google Cache Search engine cache is a cache of web pages that shows the page as it was when it was indexed by a web crawler. Cached versions of web pages can be used to view the contents of a page when the live version cannot be reached, has been altered or t ...
(ephemeral) *http://webcache.googleusercontent.com/search?q=cache:http://www.gapan.org/ruth-documents/Masters%2520Medal%2520%2520Press%2520Release.pdf :Note: Links quickly expire. :Note: Cannot be accessed with Memento. ; Bing Cache (ephemeral) Only accessible through search results, not manually through an URL or search prefix.


Undocumented

* https://cachedview.nl/ * https://cachedview.com/ * http://www.cachedpages.com/ * https://commoncrawl.org/ * https://www.bravenewtech.org/ "Video Vault" * https://conifer.rhizome.org/ * https://archive.st/


See also

* List of Web archiving initiatives * Help:Archiving a source


References

{{Reflist