INDEX
Explanations
topics or instances of discussion
occurrences of the word "discussion"
New Auto-Interp
Negative Logits
peria
-0.76
sole
-0.74
aches
-0.71
otto
-0.71
inker
-0.69
ells
-0.68
redit
-0.67
uilt
-0.67
alties
-0.66
inates
-0.66
POSITIVE LOGITS
discussion
1.18
Discussion
1.03
discussions
1.00
Topics
0.97
forum
0.94
Discuss
0.93
discussing
0.90
debates
0.88
atorium
0.85
debate
0.83
Activations Density 0.014%