INDEX
Explanations
emotional expressions and feelings conveyed through metaphoric language
New Auto-Interp
Negative Logits
Tanz
-0.72
Seym
-0.68
Container
-0.67
Commons
-0.67
Opportun
-0.66
Folk
-0.64
Monroe
-0.64
Lancaster
-0.64
RAF
-0.62
Samar
-0.62
POSITIVE LOGITS
ï¸ı
1.24
udes
0.95
agree
0.93
tal
0.92
ude
0.88
âĤ¬
0.84
ti
0.83
exist
0.82
tu
0.82
dro
0.82
Activations Density 0.328%