INDEX
Explanations
references to pain and physical experiences
observations and conditions
New Auto-Interp
Negative Logits
feeling
-0.87
Feel
-0.84
felt
-0.82
Feel
-0.81
feels
-0.81
FEEL
-0.80
feel
-0.79
Feeling
-0.78
feelings
-0.77
Feeling
-0.74
POSITIVE LOGITS
Tembelea
0.57
enfans
0.57
ब्रेकडाउन
0.54
drapeau
0.50
sayap
0.48
了嗎
0.48
ainfi
0.48
Tatsache
0.48
faſt
0.48
誒
0.47
Activations Density 0.220%