INDEX
Explanations
repeated mentions of specific personal names
names of individuals or characters
New Auto-Interp
Negative Logits
intent
-0.74
pmwiki
-0.74
ERAL
-0.66
Tyrann
-0.64
gio
-0.64
Aliens
-0.63
mbol
-0.63
egal
-0.61
ãĥķãĤ©
-0.61
tenance
-0.61
POSITIVE LOGITS
Owen
1.09
oken
0.89
Ree
0.85
shire
0.81
terness
0.79
oven
0.78
bridge
0.76
kefeller
0.76
Farrell
0.75
Owens
0.75
Activations Density 0.011%