INDEX
Explanations
words and phrases related to conflict or struggle
New Auto-Interp
Negative Logits
agg
-0.16
alty
-0.16
ities
-0.16
heit
-0.15
upon
-0.15
appen
-0.15
icit
-0.14
aja
-0.14
cia
-0.14
ately
-0.14
POSITIVE LOGITS
tooth
0.21
back
0.21
against
0.18
club
0.17
à¸Ĺาà¸Ļ
0.17
çīĻ
0.16
inh
0.16
Against
0.16
Tooth
0.15
ning
0.15
Activations Density 0.029%