INDEX
Explanations
phrases related to feelings and the needs of individuals
New Auto-Interp
Negative Logits
Dun
-0.15
ç½²
-0.15
Gamb
-0.14
unta
-0.14
lem
-0.14
onder
-0.14
letics
-0.14
ëŁ
-0.14
ipay
-0.14
levels
-0.14
POSITIVE LOGITS
bell
0.17
/inet
0.16
aths
0.15
TURE
0.15
ç
0.15
table
0.14
acket
0.14
aida
0.14
layan
0.14
oyal
0.13
Activations Density 0.339%