INDEX
Explanations
abstract concepts related to conflict or confrontation
New Auto-Interp
Negative Logits
constitu
-0.61
Places
-0.56
Seb
-0.54
Principles
-0.54
eger
-0.53
ĺħ
-0.53
cale
-0.52
IZE
-0.52
Pione
-0.50
Engineers
-0.50
POSITIVE LOGITS
away
1.06
ings
1.00
aways
0.95
offs
0.92
outs
0.91
athon
0.88
back
0.87
up
0.87
backs
0.85
out
0.83
Activations Density 9.171%