INDEX
Explanations
instances of the word "discussion"
mentions of discussions on various topics
New Auto-Interp
Negative Logits
peria
-0.77
ensions
-0.75
uilt
-0.71
otto
-0.71
prints
-0.68
printed
-0.68
otted
-0.67
ussy
-0.67
clad
-0.66
elled
-0.65
POSITIVE LOGITS
forum
0.97
forums
0.93
ļéĨĴ
0.91
discussion
0.91
atorium
0.81
exchanges
0.79
discussions
0.79
topics
0.79
ij士
0.78
naire
0.77
Activations Density 0.020%