INDEX
Explanations
calls for action and collaboration in addressing issues
New Auto-Interp
Negative Logits
fame
-0.66
obar
-0.66
abilia
-0.61
hung
-0.61
famous
-0.60
tabl
-0.59
fried
-0.59
amusement
-0.58
unct
-0.58
popped
-0.58
POSITIVE LOGITS
legisl
1.03
responsibly
1.03
ASAP
0.96
ourselves
0.86
urgently
0.83
leadership
0.82
toward
0.82
defensively
0.81
tomorrow
0.80
offensively
0.80
Activations Density 0.191%