INDEX
Explanations
sentences that express health issues and physical discomfort
New Auto-Interp
Negative Logits
bigoplus
-0.54
inasmuch
-0.53
deoarece
-0.48
Moreover
-0.48
bigsqcup
-0.46
odotus
-0.45
ต่อไป
-0.42
poiché
-0.41
notably
-0.41
tably
-0.41
POSITIVE LOGITS
gotta
1.10
got
1.08
Been
1.06
gonna
1.05
Need
1.01
Got
1.01
Been
0.98
Gonna
0.98
need
0.95
Took
0.95
Activations Density 0.234%