INDEX
Explanations
words related to different levels, degrees, or sizes of a specific quality or characteristic
phrases indicating varying degrees of importance or quality
New Auto-Interp
Negative Logits
idav
-0.69
etsk
-0.67
skirts
-0.62
iso
-0.62
pes
-0.61
Shut
-0.60
Alv
-0.59
pta
-0.59
MAP
-0.58
plets
-0.58
POSITIVE LOGITS
magnitude
1.11
importance
1.05
caliber
1.03
proportions
0.99
course
0.96
stature
0.95
renown
0.90
utmost
0.89
origin
0.87
calib
0.83
Activations Density 0.202%