INDEX
Explanations
references to academic or career stages, particularly focusing on the terms "senior" and "freshman."
New Auto-Interp
Negative Logits
"
-0.70
“
-0.68
-0.67
-0.65
(
-0.59
A
-0.56
$
-0.52
I
-0.52
↵↵
-0.51
p
-0.51
POSITIVE LOGITS
poffible
1.10
myſelf
1.07
Monfieur
1.07
itſelf
1.06
againſt
1.06
Anſ
1.04
Jefus
1.04
Theſe
1.03
whoſe
1.03
Majefty
1.02
Activations Density 0.208%