INDEX
Explanations
instances of the word "avoid" along with the value 9 or 10, indicating an emphasis on caution or prevention
instances of the word "avoid."
New Auto-Interp
Negative Logits
iop
-0.90
geist
-0.69
essee
-0.68
Rated
-0.67
cart
-0.66
ART
-0.64
otle
-0.63
opter
-0.63
Ready
-0.62
Directorate
-0.62
POSITIVE LOGITS
detection
0.78
ably
0.75
pitfalls
0.74
wasting
0.72
ading
0.68
evade
0.67
avoid
0.67
avoidance
0.66
nels
0.66
azaki
0.65
Activations Density 0.026%