INDEX
Explanations
references to the concept of "it."
phrases related to thinking about or discussing something
New Auto-Interp
Negative Logits
steen
-0.69
rams
-0.64
Colo
-0.64
strom
-0.63
Towns
-0.63
Lar
-0.61
Greens
-0.60
Preston
-0.59
poons
-0.59
Pirates
-0.58
POSITIVE LOGITS
chy
1.01
alian
0.91
self
0.84
happening
0.82
atical
0.80
atic
0.80
beforehand
0.79
raining
0.75
happens
0.74
MpServer
0.74
Activations Density 0.102%