INDEX
Explanations
verbs indicating an action being forcefully halted or prevented
expressions related to the concept of stopping or not stopping
New Auto-Interp
Negative Logits
ever
-0.76
ighth
-0.76
女
-0.75
axter
-0.71
eah
-0.69
yl
-0.68
iosyncr
-0.68
iquette
-0.68
erd
-0.67
ety
-0.67
POSITIVE LOGITS
Cooke
0.72
scrolling
0.67
flowing
0.63
Curt
0.62
grinning
0.62
spew
0.61
Despair
0.59
bragging
0.59
ticking
0.58
bothering
0.58
Activations Density 0.071%