From CIRCA

Jump to: navigation, search

LaTeX code for Archive-IT

Click here for a plain text version of this LaTeX code.
\documentclass[a4paper,11pt]{article}
\usepackage{ulem}
\usepackage{a4wide}
\usepackage[dvipsnames,svgnames]{xcolor}
\usepackage[pdftex]{graphicx}

\usepackage{hyperref}
% commands generated by html2latex


\begin{document}
\begin{tabular}

\subsection{Contents}
\begin{itemize}
	\item \hyperlink{Archive-It_Project_Plan}{1Archive-It Project Plan}
\begin{itemize}
	\item \hyperlink{Step_One}{1.1Step One}
	\item \hyperlink{Step_Two}{1.2Step Two}
	\item \hyperlink{Step_Three}{1.3Step Three}
\end{itemize}
\end{itemize}
\end{tabular}\hypertarget{Archive-It_Project_Plan}{}

\subsection{ Archive-It Project Plan }

Archive-It is a web archiving service developed by the Internet Archive "that helps organizations to harvest, build, and preserve collections of digital content." ([\href{http://www.archive-it.org/learn-more}{[1]}])

The University of Alberta Libraries has an Archive-It subscription and our project for collecting content related to the history of Humanities Computing is the first research collaboratory at the University of Alberta to make use of this service.\hypertarget{Step_One}{}

\subsubsection{Step One}
\begin{itemize}
	\item  Before our project team begins crawling and collecting content we need to practice test-crawls to become familiar with the service and to identify the breadth and scope included in various parameters of searching.
	\item  Out test crawl will begin with an item of historical interest that was active in a specific time frame such as a newsletter or report.
	\item  Following this test crawl we will see if we can get data from the search results and determine if any of the text analysis tools at our disposal can be applied. If this is possible we will continue with the project.
	\item  Before beginning any crawl we will need to check the Way Back Machine to make sure the Internet Archive isn't already crawling the content; we do not want to make repeat crawls.
\end{itemize}\hypertarget{Step_Two}{}

\subsubsection{Step Two}
\begin{itemize}
	\item  We will come up with and identify ten different types of sites to crawl. For example:
\begin{itemize}
	\item  Institute
	\item  Journal
	\item  Technological
	\item  Blogs (conference notes)
	\item  Tweets (hashtags)
	\item  Events
\end{itemize}
\end{itemize}\hypertarget{Step_Three}{}

\subsubsection{Step Three}
\begin{itemize}
	\item  We will evaluate the project and determine if:
\begin{itemize}
	\item  (1) We would like to continue making and analyzing crawls; and
	\item  (2) We should continue using a portion of the Library's subscription or decide if it would be worthwhile to purchase our own subscription to Archive-It from the Internet Archive.
\end{itemize}
	\item  We will share our collaborative experience with the Library with other members of the campus community and to broader audiences as well.
\end{itemize}

\end{document}
Personal tools