INDEX
Explanations
phrases related to physical harm or injury
gerunds and participles in relation to actions or activities
New Auto-Interp
Negative Logits
Present
-0.79
lance
-0.73
present
-0.72
κ
-0.68
draw
-0.68
ingen
-0.68
Jump
-0.66
writer
-0.66
staff
-0.66
lander
-0.65
POSITIVE LOGITS
imentary
0.85
axy
0.81
gorith
0.79
azar
0.78
gorithm
0.76
ergic
0.73
enaries
0.70
ISTER
0.70
arine
0.68
abama
0.68
Activations Density 0.016%