INDEX
Explanations
phrases that indicate emotional or existential reflection
New Auto-Interp
Negative Logits
Morg
-0.20
ÐļиÑĶ
-0.17
Fresno
-0.15
ernes
-0.15
MG
-0.15
arası
-0.14
hive
-0.14
hood
-0.14
çıł
-0.14
nest
-0.14
POSITIVE LOGITS
ham
1.33
Hamilton
1.31
Ham
1.31
Ham
1.21
ham
1.15
Hamilton
1.12
.ham
1.07
HAM
1.02
Hammer
0.94
hammer
0.88
Activations Density 0.044%