INDEX
Explanations
mentions of specific items or concepts like names or terms indicated by "of"
instances of blank or empty sections in the text
New Auto-Interp
Negative Logits
respective
-0.61
commute
-0.59
submar
-0.59
misunder
-0.58
respons
-0.58
iste
-0.57
campaigned
-0.57
accompan
-0.56
reperto
-0.56
visitor
-0.56
POSITIVE LOGITS
course
0.98
course
0.95
Course
0.92
icial
0.84
Contents
0.84
sorts
0.76
Interest
0.74
Devices
0.72
aughs
0.71
Horror
0.71
Activations Density 0.051%