INDEX
Explanations
terms related to physical actions or events
references to actions involving physical confrontation or interaction
New Auto-Interp
Negative Logits
SPONSORED
-0.79
SEA
-0.70
²¾
-0.68
ITNESS
-0.67
iterranean
-0.66
amia
-0.64
Teachers
-0.64
xia
-0.61
Dim
-0.61
âĶľ
-0.60
POSITIVE LOGITS
onto
0.87
into
0.81
away
0.76
glass
0.75
itch
0.75
goodbye
0.74
slider
0.73
button
0.71
izont
0.71
onto
0.69
Activations Density 0.597%