INDEX
Explanations
names of researchers, editors, and authors in academic contexts
plural nouns or nouns with similar endings
New Auto-Interp
Negative Logits
exception
-0.62
exceptions
-0.59
Ocean
-0.58
fraction
-0.58
recharge
-0.56
exemptions
-0.56
comparable
-0.55
SIGN
-0.55
ASED
-0.55
arts
-0.55
POSITIVE LOGITS
ki
1.83
kaya
1.75
ky
1.72
mith
1.69
hire
1.57
nyder
1.47
hip
1.45
iewicz
1.36
haw
1.36
cu
1.35
Activations Density 0.181%