INDEX
Explanations
references to the name "Barack" in a text
repeated references to a specific individual, likely a politician
New Auto-Interp
Negative Logits
lihood
-0.94
EEE
-0.81
Marketable
-0.76
CRIP
-0.75
EngineDebug
-0.72
ãĥ¼ãĥĨãĤ£
-0.71
EVA
-0.69
ãĥĩãĤ£
-0.68
Tradable
-0.68
EY
-0.68
POSITIVE LOGITS
bara
1.32
celona
1.24
rier
0.99
becue
0.96
anca
0.96
aja
0.95
bar
0.93
riers
0.91
Bar
0.91
itone
0.91
Activations Density 0.005%