INDEX
Explanations
phrases related to specific names, possibly related to locations or people
mentions of the Rockefeller and Morgenthau families
New Auto-Interp
Negative Logits
shapeshifter
-0.74
Buffy
-0.68
Plato
-0.66
Wonderland
-0.66
Syri
-0.65
netflix
-0.64
most
-0.64
fluorescent
-0.63
debtor
-0.61
Hasan
-0.61
POSITIVE LOGITS
helle
1.29
kefeller
1.11
atell
1.06
enthal
0.96
kef
0.95
hett
0.91
bons
0.89
afort
0.88
hester
0.87
Roc
0.83
Activations Density 0.015%