INDEX
Explanations
instances of critical analysis related to cultural topics
New Auto-Interp
Negative Logits
رة
-0.15
ipel
-0.14
bai
-0.14
Commercial
-0.14
eprom
-0.14
atts
-0.14
commercial
-0.13
xaa
-0.13
elow
-0.13
Wong
-0.13
POSITIVE LOGITS
throughout
0.18
reader
0.17
chapter
0.16
chapters
0.16
readers
0.16
Reader
0.15
amber
0.15
ived
0.14
fault
0.14
chapter
0.14
Activations Density 0.071%