INDEX
Explanations
words indicating capability or suitability, particularly those ending in 'able'
New Auto-Interp
Negative Logits
ing
-0.08
ed
-0.08
ese
-0.08
ãĤ¥
-0.07
egal
-0.07
asso
-0.07
edb
-0.07
arily
-0.07
kowski
-0.07
el
-0.07
POSITIVE LOGITS
heid
0.09
-bodied
0.09
Jar
0.07
uchar
0.07
ilty
0.07
yg
0.07
머ëĭĪ
0.07
ãĥ¼ãĥĹ
0.07
ipsis
0.06
Jar
0.06
Activations Density 0.072%