INDEX
Explanations
phrases describing progression, growth, or transformation
phrases indicating transformation or development processes
New Auto-Interp
Negative Logits
alty
-0.69
ounge
-0.64
ighed
-0.64
-0.62
ington
-0.61
nets
-0.61
leases
-0.61
Advice
-0.60
bid
-0.60
hair
-0.59
POSITIVE LOGITS
ãĥĩãĤ£
0.77
manageable
0.77
ELF
0.74
ãĤ©
0.74
livion
0.72
ç
0.71
usable
0.69
OGR
0.68
çİĭ
0.68
FK
0.67
Activations Density 0.439%