INDEX
Explanations
names and places related to political conflicts and agreements
New Auto-Interp
Negative Logits
rils
-0.31
aminer
-0.30
opus
-0.28
ikarp
-0.27
ails
-0.27
ammy
-0.26
oths
-0.26
rums
-0.26
Shards
-0.26
shards
-0.26
POSITIVE LOGITS
BIL
0.38
INGTON
0.29
BILITY
0.29
STAT
0.28
CLASSIFIED
0.28
BLE
0.28
DEN
0.28
CLOSE
0.28
gettable
0.27
ASS
0.27
Activations Density 0.029%