INDEX
Explanations
numbers in the text
references to statistics or numerical totals
New Auto-Interp
Negative Logits
tera
-0.69
soType
-0.68
privile
-0.63
era
-0.63
hend
-0.62
Receiver
-0.62
riers
-0.60
Bomb
-0.59
Branch
-0.58
Quote
-0.58
POSITIVE LOGITS
sorts
0.90
eighty
0.73
oscopic
0.69
pounds
0.69
course
0.68
120
0.67
abama
0.66
sixty
0.66
seventy
0.66
eight
0.66
Activations Density 0.070%