INDEX
Explanations
phrases related to breaking or disruption
references to a specific television show or media franchise
New Auto-Interp
Negative Logits
ser
-0.75
minist
-0.70
sal
-0.67
ient
-0.66
administrative
-0.65
ur
-0.64
potent
-0.64
latitude
-0.62
hed
-0.62
fearful
-0.62
POSITIVE LOGITS
Break
3.81
Break
2.50
break
1.83
break
1.58
Breaker
1.47
breakdown
1.31
breaks
1.30
Broken
1.29
Breaking
1.29
Blow
1.25
Activations Density 0.017%