INDEX
Explanations
the term "well" and its variations throughout the text
New Auto-Interp
Negative Logits
Schaefer
-0.78
ობ
-0.70
ᾶ
-0.67
icksburg
-0.67
cifix
-0.66
dci
-0.64
TableBody
-0.63
Titans
-0.63
Brü
-0.61
Giordano
-0.61
POSITIVE LOGITS
well
2.93
WELL
2.50
well
2.48
Well
2.48
Well
2.45
WELL
2.13
wells
1.98
wells
1.69
Wells
1.57
bien
1.53
Activations Density 0.076%