INDEX
Explanations
medical conditions or incidents requiring serious attention or consequences
references to serious injuries or conditions
New Auto-Interp
Negative Logits
seamlessly
-0.77
Twain
-0.75
effortlessly
-0.75
Tycoon
-0.72
perfect
-0.70
Kinnikuman
-0.69
Favorite
-0.67
ramid
-0.67
creen
-0.66
perpetually
-0.66
POSITIVE LOGITS
injury
0.98
serious
0.96
harm
0.89
serious
0.87
injuries
0.87
offences
0.85
Injury
0.84
offenders
0.84
ptoms
0.83
consequences
0.82
Activations Density 0.022%