INDEX
Explanations
phrases that indicate obtaining or acquiring something
New Auto-Interp
Negative Logits
AndGet
-0.19
Current
-0.17
meer
-0.16
uments
-0.15
unsch
-0.15
asa
-0.15
aire
-0.15
ammo
-0.14
icie
-0.14
sse
-0.14
POSITIVE LOGITS
rid
0.54
hold
0.31
chas
0.29
Rid
0.28
into
0.27
tings
0.27
rid
0.26
started
0.26
involved
0.24
aways
0.23
Activations Density 0.149%