INDEX
Explanations
proper nouns
the occurrences of the name "John."
New Auto-Interp
Negative Logits
liga
-0.83
pmwiki
-0.82
flies
-0.69
PDATE
-0.68
iture
-0.68
awaru
-0.67
compr
-0.67
Loading
-0.65
REP
-0.65
favour
-0.64
POSITIVE LOGITS
athan
1.15
Doe
1.13
Hancock
1.00
ston
0.99
Cena
0.99
nie
0.94
Birch
0.90
Wiley
0.88
Hopkins
0.88
Podesta
0.86
Activations Density 0.021%