INDEX
Explanations
words related to textbooks
references to textbooks and educational materials
New Auto-Interp
Negative Logits
inion
-0.74
Vote
-0.69
ths
-0.65
inth
-0.64
oln
-0.63
arching
-0.63
pid
-0.62
ldon
-0.62
tail
-0.62
ounty
-0.62
POSITIVE LOGITS
textbook
1.38
textbooks
1.06
ãĥ¼ãĥĨ
0.97
é¾įå¥ij士
0.91
ļéĨĴ
0.86
ãĥ¼ãĥĨãĤ£
0.86
books
0.78
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.77
essors
0.76
Clicker
0.75
Activations Density 0.004%