INDEX
Explanations
words related to quarrels or disputes
New Auto-Interp
Negative Logits
627
-0.17
_extent
-0.15
ivo
-0.15
uts
-0.14
moment
-0.14
454
-0.14
ively
-0.14
imming
-0.14
Ori
-0.14
mans
-0.14
POSITIVE LOGITS
antine
0.30
rels
0.23
anta
0.21
ANTA
0.20
tern
0.20
antor
0.20
term
0.20
rell
0.19
rel
0.19
quar
0.18
Activations Density 0.004%