INDEX
Explanations
punctuation and numeric values within the text
New Auto-Interp
Negative Logits
phia
-0.20
opsy
-0.16
737
-0.16
swer
-0.16
iya
-0.16
kowski
-0.15
iber
-0.15
705
-0.15
CommandType
-0.14
oom
-0.14
POSITIVE LOGITS
Britt
0.16
resett
0.16
ì§Ģ
0.15
CP
0.15
ÅĽcie
0.15
Walk
0.15
Pell
0.15
ष
0.14
Walk
0.14
åŃ
0.14
Activations Density 0.028%