Skip to main content

View Post [edit]

Poster: TwoNucker Date: Oct 9, 2021 4:09pm
Forum: general Subject: .

This post was modified.
This post was modified by TwoNucker on 2021-10-09 23:09:32

Reply [edit]

Poster: SomebodySmart Date: Oct 9, 2021 1:14pm
Forum: general Subject: Re: Congressional websites

But my point is that Internet Archive keeps trying to crawl those Members' websites after they have been taken down. That wastes Internet Archive resources and the Congressional website resources by downloading their "Page Not Found" notices.

My point is that archive.org should continue to display the pages that were crawled while the member was in office and there was material at those URL's, but should stop crawling shortly after the website is taken down.

As information, Congressional websites seem to be subdomains now, and instead of http://www.house.gov/member they would be https://member.house.gov

As time goes on and members leave Congress, and their websites get shut down, I do not know if they re-use a domain for a new member with the same surname.

This post was modified by SomebodySmart on 2021-10-09 20:13:58
This post was modified by SomebodySmart on 2021-10-09 20:14:24