INDEX
Explanations
expressions related to states of being and living situations
New Auto-Interp
Negative Logits
antu
-0.17
ghi
-0.17
elsewhere
-0.15
åºŃ
-0.15
inline
-0.14
inds
-0.14
acha
-0.14
ane
-0.14
akit
-0.14
OLT
-0.14
POSITIVE LOGITS
ÙħÙĨÙĩ
0.20
ÙģÙĬÙĩ
0.16
å±ŀ
0.16
upon
0.16
عÙĦÙĬÙĩا
0.16
å¤Ħ
0.15
è¦
0.15
udden
0.15
upon
0.15
icle
0.15
Activations Density 0.143%