INDEX
Explanations
phrases related to avoiding or preventing something negative or undesirable
New Auto-Interp
Negative Logits
geist
-0.77
cart
-0.71
toc
-0.68
bleacher
-0.68
cow
-0.68
Must
-0.67
lease
-0.66
rooms
-0.65
ammy
-0.65
Led
-0.65
POSITIVE LOGITS
detection
1.26
pitfalls
0.97
collisions
0.86
wasting
0.86
relegation
0.84
accidents
0.82
regress
0.81
answering
0.79
hazards
0.79
fou
0.79
Activations Density 0.624%