INDEX
Explanations
expressions of personal feelings and emotions
New Auto-Interp
Negative Logits
houſe
-0.75
Houſe
-0.68
balleur
-0.68
majánló
-0.65
+#+#
-0.63
estadounid
-0.63
linawan
-0.61
ésultats
-0.60
exitRule
-0.59
avoient
-0.59
POSITIVE LOGITS
feel
0.88
feels
0.85
Feel
0.73
felt
0.73
feeling
0.70
feel
0.68
FEEL
0.67
feels
0.65
Feels
0.65
Feel
0.65
Activations Density 0.204%