OpenIntro is a site that publishes several open source textbooks. This page on their site provides a compilation of datasets from a variety of sources that can be used with these textbooks. This includes datasets that would have otherwise disappeared from the Internet.
OPenIntro. Data Sets: Textbook data sets plus more. Available in html format.