INDEX
Explanations
years represented as four digits
instances of the year 2000
New Auto-Interp
Negative Logits
atories
-0.72
warts
-0.71
senal
-0.69
venge
-0.67
Else
-0.67
artifacts
-0.66
aste
-0.66
ceiver
-0.64
atory
-0.63
akening
-0.63
POSITIVE LOGITS
mAh
0.84
istar
0.76
visors
0.76
Hz
0.73
rpm
0.72
MHz
0.71
hz
0.71
MHz
0.69
kHz
0.68
å¹
0.68
Activations Density 0.018%