inventory of tar archives in seedis.census.gov


summary of seedis disks
removal of empty and duplicate directories
tar file inventory (on seedis.census.gov)
checks for completeness
script files


summary of seedis disks (as of 2/27/97)

GB used from seedis directory listings
GB free from "show dev d"
GB other = 9.1 - GB used - GB free
GB total = 9.1 GB
directories pending from missing.txt
duplicate directories from dupfile.txt

							GB	GB	GB
disk			dirs*	files	blocks		used	free	other
dka200			182	 894	15201208	 7.8	 1.3	0.0
dka300			152	 568	14539713	 7.4	 1.5	0.2
dka400			  0	   0	       0       	 0.0	 6.9	2.2
dka500			113	 552	15319736	 7.8	 1.2	0.1
dka600			150	 758	15214739	 7.8	 1.0	0.3
dka700			  0	   0	       0	 0.0	 7.3	1.8

total			597	2780	61000000	30.8	19.2	4.6

* dirs does not include [seedis]mss.dir


tar file inventory (on seedis.census.gov)

VMS directory listings
dka200
dka300
dka500
dka600

tarfile summary statistics summary.txt
tarfile inventory tarfiles.txt
lstfile inventory lstfiles.txt
logfile inventory logfiles.txt
tarfiles missing from seedis missing.txt
tarfile directory locations dirloc.txt
duplicate directories dupfile.txt


checks for completeness

tarinfo: lines with status 1 (selected) or 7 (completed) in tarinfo.txt, excluding most duplicate tapes and truncated tar files.

(861) tarfiles on MSS
(597) status 7 tarfiles in tarinfo.txt
(597) tarfiles in seedis
(597) primary and duplicate tapes

format	source	file		MSS	tarinfo	seedis

tar	gss	gss.tar		661	455	455	
tar	bck	all.tar		260	142	142
tar	total			861	597	597	

lst	gss	gss.red		686	na	410
lst	bck	bck.lst		225	na	102
lst	total			911	na	512

log	gss	gssa.log	828	na	381
log	bck	bcka.log	269	na	117
log	total		       1097	na	498


(597) subsets in tarfiles

disk	tarfiles	subsets
dka200	182		205289
dka300	152		145894
dka500	113		152796
dka600	150		 80994
total	597		584973


script files

VMS directory listings:

rlogin seedis.census.gov -l seedis
set def dka200:[seedis.mss]
dir/size/date/nohead/out=dka200.lst [...]
(repeat for disks dka300 through dka700)
logout

rlogin cedr.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis/inventory
ftp seedis.census.gov
user seedis
(password)
cd dka200:[seedis.mss]
get dka200.lst
(repeat for disks dka300 through dka700)
quit

summary and file inventory:

rlogin cedr.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis/inventory
inventory.pl summary.txt tarfiles.txt lstfiles.txt logfiles.txt

tarfiles missing in seedis:

rlogin cedr.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis
missing.pl > missing.txt

tarfile directory locations:

rlogin cedr.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis/inventory
dirloc.pl > dirloc.txt

duplicate file locations:

rlogin cedr.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis/inventory
dupfile.pl > dupfile.txt

remove duplicates

Repeatedly, duplicate directories were removed until the files dirloc.txt and dupfile.txt were empty.

move directories

35 directories (1.0 GB) in dka700:[seedis.mss...] were copied to dka200:[seedis.mss...] with 700to200.com. Then the files dka700:[seedis.mss.*]*.*.* and dka700:[seedis.mss]*.dir.* were deleted.

68 directories (3.5 GB) in dka400:[seedis.mss...] were copied to dka300:[seedis.mss...] with 400to300.com. Then the files dka400:[seedis.mss.*]*.*.* and dka400:[seedis.mss]*.dir.* were deleted.

count status 7 tarfiles in tarinfo.txt

rlogin parep2.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis
grep ^7 tarinfo.txt > tarinfo7.txt

identify primary and duplicate tapes

rlogin parep2.lbl.gov -l merrill
cd $MDOCS/census/tar2seedis
duptape.csh
(makes duptape.txt from tarinfo.txt )

ftp census.seedis.gov
user seedis
cd dka400:[seedis.csa3.lsttape]
put duptape.txt
quit

check subsets in tarfiles

The following script produced log files dka*:[seedis.mss]checktar*.log, lists of tarfile subsets dka*:[seedis.mss.*]*.lst, counts of tarfile subsets dka*:[seedis.mss.*]*.lcnt, and tarfile summary dka200:[seedis.mss]tarfiles.coda.

rlogin seedis.census.gov -l seedis

set def dka200:[seedis.mss]
submit/log=dka200:[seedis.mss]checktar200.log/cputime=infinite checktar200.com
type/out=dka200.lcnt [.*]*.lcnt.

set def dka300:[seedis.mss]
submit/log=dka300:[seedis.mss]checktar300.log/cputime=infinite checktar300.com
type/out=dka300.lcnt [.*]*.lcnt.

set def dka500:[seedis.mss]
submit/log=dka500:[seedis.mss]checktar500.log/cputime=infinite checktar500.com
type/out=dka500.lcnt [.*]*.lcnt.

set def dka600:[seedis.mss]
submit/log=dka600:[seedis.mss]checktar600.log/cputime=infinite checktar600.com
type/out=dka600.lcnt [.*]*.lcnt.

set def dka200:[seedis.mss]
copy/concat -
tarfiles.ddf,-
dka200:dka200.lcnt,-
dka300:dka300.lcnt,-
dka500:dka500.lcnt,-
dka600:dka600.lcnt -
tarfiles.cod

vi tarfiles.cod
(remove null lines)
(put period at end of each data line)

ftp to $MDOCS/census/tar2seedis/tarfiles.cod ftp to $MDOCS/census/tar2seedis/tarfiles.ddf


back to ftp from MSS to seedis.census.gov


census/tar2seedis/inventory.html 9/18/97 in:

http://parep2.lbl.gov/mdocs
http://merrill.wwh.net/mdocs
http://imap.chesapeake.net/~merrill/mdocs

merrill@crocker.com