INDEX
Explanations
personal names, such as "Bush" and "Trump" along with related words
references to prominent political figures and their roles
New Auto-Interp
Negative Logits
Translation
-0.63
guiName
-0.62
âĶģ
-0.62
farious
-0.61
Ò
-0.61
igious
-0.60
actionDate
-0.59
soType
-0.58
%%%%
-0.58
Include
-0.57
POSITIVE LOGITS
cheated
1.10
stole
1.06
survived
1.03
forgot
1.01
loves
1.01
died
1.00
hates
1.00
blew
0.99
screwed
0.98
overcame
0.97
Activations Density 0.625%