INDEX
Explanations
references to fleas
mentions of fleas or similar insects
New Auto-Interp
Negative Logits
LESS
-0.68
raints
-0.68
Demand
-0.67
DOWN
-0.64
Kislyak
-0.62
Mandarin
-0.62
Japanese
-0.62
ript
-0.62
Hussein
-0.61
raint
-0.61
POSITIVE LOGITS
fle
1.08
llo
1.01
uve
1.00
Fle
0.95
bats
0.85
Fle
0.83
fle
0.83
lla
0.80
mington
0.76
ll
0.74
Activations Density 0.005%