INDEX
Explanations
instances of the word "almost"
New Auto-Interp
Negative Logits
fts
-0.16
alink
-0.16
Seasons
-0.15
orem
-0.15
claimer
-0.15
merely
-0.15
mere
-0.15
пÑĥ
-0.14
ãn
-0.14
242
-0.14
POSITIVE LOGITS
exclusively
0.24
entirely
0.21
certainly
0.20
every
0.19
immediately
0.19
always
0.18
imper
0.18
everything
0.18
identical
0.18
ready
0.17
Activations Density 0.033%