INDEX
Explanations
names or terms related to a specific individual or entity
mentions of specific names or terms related to individuals or entities
New Auto-Interp
Negative Logits
Hew
-0.72
Veronica
-0.67
Mess
-0.67
Commodore
-0.66
OCD
-0.66
Wonderland
-0.64
Visual
-0.62
Winning
-0.60
Kitty
-0.60
EN
-0.60
POSITIVE LOGITS
arb
4.74
arak
1.75
arbon
1.55
bard
1.44
adish
1.44
ARB
1.14
atel
1.09
aram
1.07
elta
1.04
kef
1.04
Activations Density 0.028%