INDEX
Explanations
instances of the word "pass" in various contexts
New Auto-Interp
Negative Logits
Drummond
-0.84
SMP
-0.73
Wul
-0.69
cenario
-0.65
implements
-0.64
Haram
-0.64
Tema
-0.64
Rowley
-0.63
arakhand
-0.63
cenas
-0.63
POSITIVE LOGITS
PASS
1.60
Pass
1.59
Passes
1.54
passes
1.50
PASS
1.50
pass
1.49
Pass
1.48
pass
1.43
passing
1.39
passes
1.38
Activations Density 0.065%