READ/WRITE SERVICE TECHNIQUES
Temporary jPermanent Errors
The primary cause
of
write data checks
is
the type caused by a loss in signal from the tape
on readback during write.
On the 3420 tape subsystem when tape
is
written the ampli-
tUde\s checked
to
ensure there
is
sufficient amplitude written on tape
to
be read at a
later date. This write checking level (threshold)
is
higher than the threshold at which the
tapes will eventually be read.
If
the amplitude
of
the read signal goes below the threshold
a temporary write error occurs and the error recovery program (ERP) takes control.
ERP backs the tape up to the last gap, then performs an erase
gap
before retrying the
write operation.
If
the write
is
then error-free one temporary write error
is
logged and
normal operation continues.
If
the retried write operation again fails another erase
gap
then write
is
performed. This
is
continued until the write
is
successful or until fourteen
tries to write the record have occurred.
In
that event one loop write-to-read command
is
tried.
If
successful, one more erase gap, write
is
attempted.
If
it again fails (fifteenth
time) a permanent write error
is
posted (the temporary write errors associated with this
failure are then discarded).
If
the write command
is
successful on, for instance, try number 10 EREP would show
nine erase gaps (actual number)
but
only one temporary write error. The reason for this
is
that write errors occurring while ERP has control are not logged
to
EREP while
all
erase
gaps
are.
Several causes
of
data check on tapes are:
1. Defects in oxide on tape
2. Dirty tape
3. Dirty tape path
4.
Edge
damage on tape
5.
Contamination on head
6. Tape device failures
When
excessive
e~rors
are encountered on a tape, corrective action should be taken. The
first action
would be to inspect the tape path for contamination.
If
any foreign material
is
present,
it
should be removed by using normal cleaning procedures.
If
error persists the
tape should be tried on a second tape device (if available) or a different, known good, tape
should be put
on
the failing tape unit. In this way the error condition can be isolated
to
the tape or
to
the tape unit.
When
a tape with large numbers
of
errors
is
found in a cus-
tomer library it should be identified and corrective action
by
customer should be taken.
This can include cleaning and recertification
of
the tape or removing the tape from the
tape library.
If
a tape shows signs
of
damage or contamination it should be removed be-
fore it can cause damage
to,
or cause contamination
to,
other tape in the customer's
library.
There are many other types
of
errors and causes
of
errors. The several covered in the
above text are the most common causes and are the majority
of
the types
of
errors the
CE
will encounter in the. field.
S3