INDEX
Explanations
expressions of personal feelings and emotions
New Auto-Interp
Negative Logits
的消息
-0.49
INSEE
-0.48
виправивши
-0.48
kaynağından
-0.47
ſche
-0.46
queſta
-0.46
houſe
-0.45
opsida
-0.44
⋙
-0.44
Houſe
-0.44
POSITIVE LOGITS
feel
0.65
feels
0.64
felt
0.60
FEEL
0.57
Feels
0.54
Feel
0.54
feeling
0.53
Feels
0.52
Feel
0.52
felt
0.49
Activations Density 0.021%