INDEX
Explanations
words related to physical injuries, such as sprains
references to sprains and related injuries
New Auto-Interp
Negative Logits
*/(
-0.90
Passage
-0.72
ocene
-0.70
uliffe
-0.69
Defenders
-0.67
uyomi
-0.67
EStream
-0.65
ogun
-0.65
galitarian
-0.65
Reviewer
-0.65
POSITIVE LOGITS
outed
0.98
outing
0.96
acer
0.94
spr
0.93
inters
0.87
atters
0.84
itely
0.81
ars
0.80
atche
0.80
ain
0.79
Activations Density 0.006%