INDEX
Explanations
phrases containing the word "count"
instances of the word "count" and its variations
New Auto-Interp
Negative Logits
backdrop
-0.63
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.61
lan
-0.60
olar
-0.60
obser
-0.59
agnetic
-0.58
obar
-0.57
folio
-0.56
apist
-0.56
mology
-0.56
POSITIVE LOGITS
enance
1.92
downs
1.02
ries
0.91
calories
0.87
rified
0.79
among
0.78
down
0.78
DOWN
0.72
towards
0.71
amongst
0.71
Activations Density 0.016%