INDEX
Explanations
proper nouns or other specific words related to certain entities or events
line breaks or breaks in text formatting
New Auto-Interp
Negative Logits
WARE
-0.72
balls
-0.67
meal
-0.66
eers
-0.64
Sax
-0.63
catch
-0.62
tale
-0.61
Totem
-0.59
caught
-0.58
ãģŁ
-0.57
POSITIVE LOGITS
ackets
1.28
acket
1.14
anches
1.06
anch
1.04
igham
1.04
unn
1.01
aces
0.99
OAD
0.99
ushed
0.97
acing
0.95
Activations Density 0.017%