INDEX
Explanations
questions starting with "How do you" and "How do we" followed by verbs
query-based phrasing or questions regarding knowledge and understanding
New Auto-Interp
Negative Logits
pict
-0.66
letter
-0.62
iken
-0.61
tn
-0.61
Boat
-0.61
letters
-0.61
prints
-0.60
Runner
-0.60
court
-0.59
requisite
-0.59
POSITIVE LOGITS
reconcile
1.03
reconcil
1.01
cope
0.87
Ô
0.82
differentiate
0.81
distinguish
0.80
handle
0.75
justify
0.74
navigate
0.73
decipher
0.73
Activations Density 0.131%