INDEX
Explanations
text related to political events and figures
New Auto-Interp
Negative Logits
scourge
-0.76
haul
-0.75
instinct
-0.72
hust
-0.71
bounded
-0.70
reven
-0.70
piping
-0.68
orche
-0.68
plet
-0.67
intended
-0.67
POSITIVE LOGITS
Lastly
1.77
Finally
1.64
Anyway
1.51
Nevertheless
1.48
Nonetheless
1.47
Meanwhile
1.46
Additionally
1.45
Eventually
1.44
Moreover
1.43
Regardless
1.43
Activations Density 1.847%