INDEX
Explanations
mentions of discussions or topics to be discussed
instances of the word "discuss."
New Auto-Interp
Negative Logits
served
-0.74
peria
-0.74
eared
-0.73
robbed
-0.72
occupied
-0.71
ifter
-0.69
pes
-0.69
installed
-0.67
bott
-0.66
gged
-0.65
POSITIVE LOGITS
Discuss
1.03
discussing
0.96
discuss
0.93
Discuss
0.92
Topics
0.90
discusses
0.90
topics
0.81
ļéĨĴ
0.76
summarizes
0.74
discussed
0.74
Activations Density 0.014%