INDEX
Explanations
adjectives related to quality or characteristics of things
New Auto-Interp
Negative Logits
irl
-0.71
jri
-0.69
ixtape
-0.68
Continent
-0.67
RN
-0.66
ramid
-0.65
xit
-0.63
gettable
-0.63
ulner
-0.62
agonist
-0.62
POSITIVE LOGITS
increments
1.13
proportions
1.03
fashion
0.97
circumstances
0.95
chronological
0.93
shape
0.92
circles
0.92
accordance
0.89
haste
0.89
terms
0.88
Activations Density 0.109%