INDEX
Explanations
explicit sexual content or references
New Auto-Interp
Negative Logits
endam
-0.57
TintMode
-0.53
ooga
-0.53
'*')
-0.50
reams
-0.49
"""",
-0.49
ammer
-0.49
ciclo
-0.48
listItem
-0.48
Advertise
-0.48
POSITIVE LOGITS
walk
0.96
walked
0.95
walks
0.88
grabbed
0.78
grab
0.76
WALK
0.76
walking
0.76
walk
0.75
enumii
0.69
Walk
0.69
Activations Density 0.359%