INDEX
Explanations
updates and corrections in text
references to updates or reported information
New Auto-Interp
Negative Logits
adolescence
-0.77
entimes
-0.74
advant
-0.73
æ©
-0.71
ãĤ´
-0.70
ocious
-0.69
eval
-0.68
quartered
-0.68
worm
-0.67
glor
-0.67
POSITIVE LOGITS
*)
1.00
typo
0.91
clarification
0.90
FOIA
0.89
.):
0.82
PDATED
0.82
.)
0.82
UPDATE
0.81
pics
0.80
Update
0.80
Activations Density 0.386%