INDEX
Explanations
words related to breaking or destruction
phrases related to the concept of "breaking" or significant disruptions
New Auto-Interp
Negative Logits
uther
-0.80
GY
-0.75
ulp
-0.72
minist
-0.69
nery
-0.67
ammy
-0.66
metics
-0.66
igmat
-0.64
iquid
-0.63
ality
-0.63
POSITIVE LOGITS
breakers
0.95
breaking
0.94
break
0.88
broke
0.85
breaking
0.83
break
0.80
breaks
0.74
breaks
0.73
necks
0.73
lyn
0.70
Activations Density 0.011%