INDEX
Explanations
phrases indicating the act of acquiring or obtaining something
New Auto-Interp
Negative Logits
aire
-0.16
Current
-0.16
meer
-0.15
icie
-0.15
ials
-0.15
sse
-0.15
ammo
-0.15
ska
-0.14
atar
-0.14
amma
-0.14
POSITIVE LOGITS
rid
0.55
hold
0.34
chas
0.31
Rid
0.29
tings
0.28
rid
0.28
started
0.27
into
0.25
involved
0.25
cha
0.24
Activations Density 0.151%