INDEX
Explanations
occurrences of the word "knowledge" and its variations
New Auto-Interp
Negative Logits
isha
-0.16
acci
-0.15
atters
-0.15
pekt
-0.13
ALIGN
-0.13
\<^
-0.13
acades
-0.13
ork
-0.13
487
-0.13
anno
-0.13
POSITIVE LOGITS
.microsoft
0.19
fully
0.17
senal
0.15
ifar
0.14
://%
0.14
crate
0.14
lenÃŃ
0.14
#af
0.14
owski
0.14
θο
0.14
Activations Density 0.023%