INDEX
Explanations
phrases related to staying away or avoiding something
phrases related to avoidance or staying away from something
New Auto-Interp
Negative Logits
catentry
-0.78
elf
-0.74
roundup
-0.72
lot
-0.67
Navajo
-0.62
MS
-0.62
erity
-0.60
Surprise
-0.59
converter
-0.58
profiling
-0.57
POSITIVE LOGITS
indefinitely
0.85
lishes
0.82
confines
0.78
andestine
0.77
bounds
0.74
forever
0.73
haun
0.69
angered
0.68
arcer
0.67
intact
0.67
Activations Density 0.134%