INDEX
Explanations
actions involving checking, examining, or evaluating something
phrases related to checking or observing situations
New Auto-Interp
Negative Logits
versive
-0.72
ÃŃa
-0.70
inqu
-0.66
theless
-0.65
onto
-0.63
prime
-0.62
whistle
-0.61
ãĥĨãĤ£
-0.61
onz
-0.61
ieri
-0.60
POSITIVE LOGITS
how
1.17
whether
1.02
firsthand
0.94
what
0.94
if
0.89
whats
0.87
how
0.83
whether
0.75
what
0.74
why
0.73
Activations Density 0.074%