INDEX
Explanations
verbs in the past tense
occurrences of the word "learned."
New Auto-Interp
Negative Logits
adies
-0.74
oided
-0.73
adra
-0.72
ankind
-0.69
abwe
-0.69
pled
-0.65
elled
-0.65
stagger
-0.64
ataka
-0.64
etry
-0.63
POSITIVE LOGITS
Lear
0.95
llor
0.88
Teach
0.81
Learned
0.78
æ©
0.78
learn
0.77
ĵĺ
0.76
çīĪ
0.76
DragonMagazine
0.75
Learning
0.74
Activations Density 0.026%