INDEX
Explanations
expressions referring to something considered standard or characteristic within a specific context
occurrences of the word "typical" and related variations
New Auto-Interp
Negative Logits
aska
-0.71
acus
-0.71
nuts
-0.70
hani
-0.68
lands
-0.68
heed
-0.68
hung
-0.67
enced
-0.65
ashore
-0.64
lean
-0.62
POSITIVE LOGITS
deviation
0.86
deviations
0.81
sized
0.81
fare
0.78
weekday
0.76
typ
0.76
ized
0.75
mammalian
0.75
attire
0.70
household
0.70
Activations Density 0.035%