b0VIM 7.4TY@}rfisherzuul02/home/www.cv.nrao.edu/homes/rfisher/DIYanalysis/DataSci&GBTarchive.html U3210#"! UtpFfGadEyFK# 6 P  i  : ` B T  o"u*B]s'~)utcb>=yx We'd like it to contain enough information to permit data searches based on useful Compiling a table of contents for 35 terabytes of data presents a challenge.

What is in the Archive?

for one scan. '2006_08_07_00:17:45.fits' is the FITS file name, made up of the UTC date and time, is the observing session designator; 'Antenna' is the telescope subsystem; and where '06B' in the observing proposal submission sub-year (2006B); 'AGBT06B_026_03' in the archive looks like "06B/AGBT06B_026_03/Antenna/2006_08_07_00:17:45.fits"; we'll use the tarball protocol in any examples to follow. The subdirectory structure file export protocol from the archive access tool has yet to be defined so and unpacking it on your local computer. As of this writing, the exact like a "tarball" something GBT archive some of the directory structure must be maintain by creating to avoid overwriting one another. Hence, when data are extracted from the name, and the data archive relies on their being in separate subdirectories Oscillator, etc.). All of the FITS files from one scan have the same file FITS files - one for each subsystem (Antenna, Receiver, Spectrometer, Local interpret the image is in one file, one GBT scan creates a half dozen or so (FITS). Unlike a typical optical data FITS file, where everything needed to All of the GBT data files adhere to the Flexible Image Transport Standard

Structure of the Archive

programming examples of how to work with the data and extract scientific results. observations from a data science perspective. We will then delve into some archive and then look in some detail at examples of the different types of these web pages we're going to start with a broad view of what is in the observer would have when looking for her or his own data in the database. In some foreknowledge of what is in the database, e.g., knowledge that a GBT that will allow anyone to download GBT data from the archive, but it may assume archive access tool

The NRAO is developing an or Enthought. companies like Anaconda systems from the Python web site or from Python, which is freely available for the Linux, Windows, and Mac operating GBT data, the tool we will use here is the programming language, avaliable from NRAO. Since this document is about some of the nitty-gritty of excellent data analysis tool called GBTIDL For scientists looking for the most direct path to publication there is an and data science can make use of this resource with basic programming tools. with an emphasis on how students, instructors, and enthusiasts of astronomy public use. This web page and its links give an overview of this data base data. After a propietary period of one year, these data are available for Bank radio telescope (GBT) has generated more than 35 terabytes of archival Since the beginning of full-scale operation in 2003, the 100-meter Green

Data Science and the Green Bank Telescope Data Archive

} column-count: 2; -moz-column-count: 2; /* Firefox */ -webkit-column-count: 2; /* Chrome, Safari, Opera */.newspaper {