INDEX
Explanations
words related to physical actions or tools
plural or gerund forms of nouns
New Auto-Interp
Negative Logits
mosqu
-0.68
citiz
-0.62
KNOWN
-0.61
newsp
-0.59
Niet
-0.59
carbohyd
-0.59
earthqu
-0.57
practition
-0.56
adolesc
-0.56
Citiz
-0.55
POSITIVE LOGITS
mith
1.15
creen
1.09
cale
1.09
etting
1.07
poons
1.03
ynthesis
1.02
hots
1.00
hare
0.99
heet
0.96
hift
0.96
Activations Density 0.436%