INDEX
Explanations
phrases referring to a specific position or state
phrases indicating various states of being or circumstances
New Auto-Interp
Negative Logits
Flavoring
-0.96
favorite
-0.68
origin
-0.66
Dresden
-0.63
Medal
-0.63
âĵĺ
-0.61
antid
-0.61
avorite
-0.60
glomer
-0.60
Brist
-0.59
POSITIVE LOGITS
WARE
0.72
Ĥİ
0.72
achy
0.70
haste
0.69
ossession
0.68
toile
0.65
nir
0.64
docker
0.63
hurry
0.63
phabet
0.61
Activations Density 0.044%