INDEX
Explanations
references to common injuries and health issues
New Auto-Interp
Negative Logits
ालय
-0.16
inson
-0.13
ικα
-0.13
ίÏĥ
-0.13
ůst
-0.13
harma
-0.12
isan
-0.12
osten
-0.12
तम
-0.12
]âĢı
-0.12
POSITIVE LOGITS
common
0.92
common
0.79
-common
0.71
Common
0.66
COMMON
0.66
commonly
0.65
Common
0.64
_common
0.63
.common
0.63
COMMON
0.62
Activations Density 0.287%