INDEX
Explanations
phrases indicating necessity or obligation
repeated uses of the phrase "has to be" in various contexts
New Auto-Interp
Negative Logits
Globe
-0.68
Rab
-0.67
fray
-0.67
Hamm
-0.62
VICE
-0.62
Kore
-0.61
Jur
-0.60
ritz
-0.60
Kurd
-0.59
guyen
-0.59
POSITIVE LOGITS
able
1.02
leeve
0.94
fitting
0.92
fits
0.91
accounted
0.90
replaced
0.88
done
0.86
dealt
0.85
understood
0.84
considered
0.84
Activations Density 0.061%