INDEX
Explanations
expressions of feelings or emotional states
New Auto-Interp
Negative Logits
Zeneca
-0.84
Stow
-0.82
SpringRunner
-0.82
Astor
-0.81
Bradbury
-0.80
>−
-0.80
genomen
-0.78
martre
-0.77
verständlich
-0.76
Hawkes
-0.75
POSITIVE LOGITS
felt
1.77
feeling
1.73
feel
1.67
feels
1.65
Feel
1.61
Feel
1.61
Feels
1.58
Felt
1.57
Feels
1.56
Felt
1.54
Activations Density 0.059%