INDEX
Explanations
references to the word "sob" and its variations, indicating themes of sadness or emotional distress
New Auto-Interp
Negative Logits
iverz
-0.18
ycin
-0.17
à¸Ĺà¸Ńà¸ĩ
-0.15
нимаÑĤÑĮ
-0.15
akis
-0.15
ãĥ³ãĥij
-0.15
idir
-0.14
erse
-0.14
ulla
-0.13
grese
-0.13
POSITIVE LOGITS
sob
0.19
ole
0.19
ri
0.18
Sob
0.18
ral
0.18
227
0.17
orno
0.17
orn
0.17
styl
0.16
rev
0.15
Activations Density 0.005%