INDEX
Explanations
instances of the phrase "don't" in various contexts
New Auto-Interp
Negative Logits
Samar
-0.64
Tasman
-0.62
Axel
-0.62
BMC
-0.61
partName
-0.60
Samson
-0.59
NH
-0.58
RAF
-0.58
Override
-0.58
Lowell
-0.57
POSITIVE LOGITS
ember
0.91
ufact
0.79
iversary
0.77
udes
0.77
addafi
0.76
tarian
0.75
aughter
0.74
resent
0.73
imately
0.73
tal
0.73
Activations Density 0.248%