INDEX
Explanations
the German word "befehl" or its variations
the presence of the substring "ef"
New Auto-Interp
Negative Logits
mileage
-0.67
scam
-0.66
Sammy
-0.66
Pathfinder
-0.65
humming
-0.62
Panama
-0.62
scams
-0.61
Columbia
-0.61
swinging
-0.61
Chapman
-0.61
POSITIVE LOGITS
ef
4.36
efer
2.03
efe
1.82
EF
1.54
eb
1.51
ec
1.45
ev
1.29
ep
1.27
eg
1.26
ek
1.26
Activations Density 0.010%