INDEX
Explanations
names related to politics
references to a specific individual named Burr
New Auto-Interp
Negative Logits
Leone
-0.76
////////////////
-0.71
Tigers
-0.70
Editors
-0.67
gaard
-0.67
e
-0.67
porting
-0.65
////////////////////////////////
-0.64
Outlook
-0.64
jay
-0.64
POSITIVE LOGITS
Burr
1.18
agus
0.85
ĵĺ
0.84
ikuman
0.81
aughs
0.80
ensical
0.79
halla
0.79
acci
0.77
ategory
0.77
iceps
0.77
Activations Density 0.013%