INDEX
Explanations
phrases related to disagreements and negotiations
New Auto-Interp
Negative Logits
ngth
-0.92
iaries
-0.77
ionage
-0.77
OGR
-0.73
panic
-0.73
vik
-0.71
ROR
-0.69
IENCE
-0.66
qv
-0.66
ribune
-0.66
POSITIVE LOGITS
whether
1.23
semantics
0.97
merits
0.96
legality
0.93
whether
0.91
appropri
0.88
priorities
0.87
definitions
0.86
topics
0.86
specifics
0.85
Activations Density 0.169%