INDEX
Explanations
instances of the word "know" and its variations, indicating knowledge or awareness
New Auto-Interp
Negative Logits
ç·Ĵ
-0.06
932
-0.06
обов
-0.06
Found
-0.06
oundary
-0.06
aub
-0.06
anto
-0.06
Boulevard
-0.06
Benef
-0.06
Formal
-0.06
POSITIVE LOGITS
enough
0.07
how
0.07
unb
0.07
POSITORY
0.07
hist
0.07
.bz
0.06
rer
0.06
background
0.06
bur
0.06
ac
0.06
Activations Density 0.022%