INDEX
Explanations
phrases related to the needs and support of individuals
New Auto-Interp
Negative Logits
simp
-0.16
رة
-0.15
sim
-0.15
ilar
-0.15
igg
-0.14
atters
-0.14
urses
-0.14
оÑģÑĤан
-0.13
ulates
-0.13
GT
-0.13
POSITIVE LOGITS
msp
0.16
overy
0.15
agy
0.15
agate
0.14
paged
0.14
Began
0.14
ÑĤик
0.14
otland
0.14
thren
0.14
mapped
0.14
Activations Density 0.017%