INDEX
Explanations
the word "broken" with varying degrees of severity
phrases that include the word "broken."
New Auto-Interp
Negative Logits
minist
-0.79
utterstock
-0.75
azor
-0.75
yss
-0.75
erva
-0.75
metics
-0.74
ickr
-0.73
idency
-0.73
itatively
-0.72
Reviewer
-0.72
POSITIVE LOGITS
neck
0.89
bones
0.86
broken
0.85
bones
0.79
broken
0.78
necks
0.75
fracture
0.73
Broken
0.71
broke
0.70
breaks
0.70
Activations Density 0.016%