INDEX
Explanations
contractions in sentences
phrases indicating certainty or obligation
New Auto-Interp
Negative Logits
atl
-0.74
Hass
-0.69
rawdownloadcloneembedreportprint
-0.65
tsky
-0.63
Collider
-0.60
Malays
-0.59
OTOS
-0.58
kin
-0.57
Clause
-0.56
hijab
-0.56
POSITIVE LOGITS
likely
1.07
automatically
0.96
surely
0.93
probably
0.91
invariably
0.89
certainly
0.87
inevitably
0.87
immediately
0.85
usually
0.84
undoubtedly
0.84
Activations Density 0.352%