INDEX
Explanations
terms related to understanding and comprehension
New Auto-Interp
Negative Logits
番
-0.63
tığı
-0.62
pick
-0.59
zości
-0.58
BIBLIO
-0.56
drop
-0.55
fabs
-0.55
spyOn
-0.54
PICK
-0.54
ofire
-0.54
POSITIVE LOGITS
understand
3.48
understand
3.23
understanding
3.20
understands
3.17
Understand
3.17
understood
3.09
Understand
3.07
understanding
2.96
Understanding
2.76
understood
2.76
Activations Density 0.071%