INDEX
Explanations
references to a specific individual
references to a specific individual or character
New Auto-Interp
Negative Logits
Pastebin
-0.67
Worlds
-0.58
Amanda
-0.58
communities
-0.58
ĨĴ
-0.58
Lima
-0.58
Result
-0.56
AM
-0.56
neighborhoods
-0.56
Railroad
-0.56
POSITIVE LOGITS
personally
1.02
panic
0.97
ading
0.95
atically
0.86
zbollah
0.85
Majesty
0.84
enegger
0.80
atic
0.80
orally
0.79
eded
0.78
Activations Density 0.100%