INDEX
Explanations
phrases that signal a change of topic or a new direction in the text
instances of the word "Well" in various contexts
New Auto-Interp
Negative Logits
omore
-0.65
illary
-0.61
adding
-0.60
ËĪ
-0.60
rift
-0.59
central
-0.59
aded
-0.59
acist
-0.58
kefeller
-0.58
Roaming
-0.57
POSITIVE LOGITS
yeah
0.95
hello
0.94
guess
0.93
prest
0.93
luckily
0.84
ington
0.83
esley
0.82
,
0.82
congratulations
0.79
done
0.78
Activations Density 0.032%