INDEX
Explanations
phrases indicating compositions or makeup of groups or objects
phrases that denote composition or elements of a group
New Auto-Interp
Negative Logits
ilings
-0.76
WT
-0.68
hawks
-0.64
oos
-0.63
hound
-0.62
apon
-0.62
mob
-0.60
thur
-0.59
Sands
-0.58
Dull
-0.58
POSITIVE LOGITS
consist
0.87
solely
0.85
encies
0.83
chiefly
0.82
ertodd
0.79
itute
0.77
principally
0.76
mainly
0.75
ãĥĦ
0.74
alion
0.74
Activations Density 0.015%