INDEX
Explanations
phrases mentioning specific dates and events
occurrences of a specific character or symbol
New Auto-Interp
Negative Logits
farmland
-0.73
polio
-0.71
wartime
-0.70
ammon
-0.70
unwanted
-0.69
resemblance
-0.69
capacity
-0.69
Mobil
-0.68
mushroom
-0.68
elusive
-0.68
POSITIVE LOGITS
efe
1.13
tis
1.13
ï¸ı
1.09
cause
1.09
she
1.08
nor
1.04
sn
1.02
ski
0.93
STEM
0.92
ometimes
0.92
Activations Density 0.180%