INDEX
Explanations
phrases related to being physically damaged or in a state of disrepair
instances of the word "broken."
New Auto-Interp
Negative Logits
atur
-0.74
gee
-0.72
appell
-0.72
Salman
-0.71
Heller
-0.70
designate
-0.69
ffer
-0.67
advertising
-0.66
respond
-0.66
iens
-0.66
POSITIVE LOGITS
broken
3.63
broken
2.28
Broken
2.10
shattered
2.05
fractured
1.95
cracked
1.75
busted
1.62
breaking
1.54
smashed
1.54
broke
1.51
Activations Density 0.013%