INDEX
Explanations
references to awards or recognitions in the context of literature and reading
words and phrases related to reading, books, and consuming media content.
New Auto-Interp
Negative Logits
ocities
-0.41
rinnov
-0.40
கூ
-0.39
fjspx
-0.37
zweif
-0.36
Schreib
-0.36
iguous
-0.36
rief
-0.35
testens
-0.35
Errorf
-0.35
POSITIVE LOGITS
movies
1.11
books
0.98
classics
0.96
films
0.94
novels
0.92
documentaries
0.77
movie
0.76
classical
0.74
movies
0.74
literature
0.73
Activations Density 0.438%