INDEX
Explanations
instances where encouragement or positive reinforcement is mentioned or implied
instances of the word "encourage" and its variations
New Auto-Interp
Negative Logits
Nanto
-0.74
entin
-0.66
abases
-0.65
sworth
-0.63
Antiqu
-0.62
Corpse
-0.61
gur
-0.61
angler
-0.60
thin
-0.60
minster
-0.59
POSITIVE LOGITS
imaru
0.83
encourage
0.77
Tradable
0.77
="#
0.77
GGGGGGGG
0.77
encourages
0.76
wcs
0.75
untarily
0.72
tale
0.71
discouraged
0.71
Activations Density 0.022%