INDEX
Explanations
phrases related to theft and stealing
references to theft or stealing
New Auto-Interp
Negative Logits
adi
-0.73
Compar
-0.71
tu
-0.70
HUD
-0.69
INO
-0.67
avering
-0.66
ractor
-0.66
sted
-0.65
tions
-0.65
eele
-0.64
POSITIVE LOGITS
unsuspecting
0.81
stolen
0.78
priceless
0.71
purse
0.70
scraps
0.70
belongings
0.70
Rs
0.69
valuable
0.67
precious
0.67
penn
0.67
Activations Density 0.257%