INDEX
Explanations
mentions of familiarity or lack thereof with certain topics or concepts
references to familiarity and unfamiliarity with concepts or subjects
New Auto-Interp
Negative Logits
Ħ¢
-0.69
tein
-0.65
avorite
-0.63
practicable
-0.63
ĺħ
-0.58
ument
-0.57
²¾
-0.57
©¶æ
-0.56
efficiency
-0.56
secondary
-0.56
POSITIVE LOGITS
izing
1.21
ized
1.19
ize
1.09
ising
1.06
ity
1.05
ities
1.02
ised
1.02
enough
1.00
izes
0.97
lly
0.95
Activations Density 0.032%