INDEX
Explanations
words related to education and specific organizations or entities like lecturers, universities, and the Postal Service
the end of sections or paragraphs in the text
New Auto-Interp
Negative Logits
ned
-0.77
ivari
-0.72
odka
-0.71
ague
-0.68
nor
-0.67
aban
-0.66
witz
-0.65
ivot
-0.65
lda
-0.65
itably
-0.65
POSITIVE LOGITS
ging
0.82
uring
0.79
gers
0.78
lishing
0.77
deen
0.74
ificantly
0.73
ilant
0.73
urous
0.73
sis
0.72
ures
0.70
Activations Density 0.135%