INDEX
Explanations
questions or statements expressing curiosity or doubt
questions expressing curiosity or inquiry about circumstances and outcomes
New Auto-Interp
Negative Logits
catentry
-0.83
20439
-0.77
mouth
-0.75
alysed
-0.74
arget
-0.69
ongs
-0.66
aration
-0.66
interstitial
-0.66
idation
-0.65
idelines
-0.65
POSITIVE LOGITS
xual
0.85
suspic
0.80
misunder
0.79
nostalg
0.76
why
0.72
fate
0.71
retribution
0.70
motives
0.70
millenn
0.70
explan
0.70
Activations Density 0.108%