INDEX
Explanations
phrases related to the concept of understanding, specifically emphasizing the act of comprehending or grasping a situation or information
New Auto-Interp
Negative Logits
onies
-0.80
hire
-0.71
boro
-0.69
\/\/
-0.69
onto
-0.67
nar
-0.67
gob
-0.66
inating
-0.65
drops
-0.64
iere
-0.64
POSITIVE LOGITS
why
1.13
WHY
1.11
how
1.06
ably
0.91
why
0.91
comprehension
0.83
Understanding
0.82
whats
0.80
what
0.79
HOW
0.77
Activations Density 0.047%