INDEX
Explanations
mentions of fictional elements or settings
mentions of fictional elements or settings within a narrative
New Auto-Interp
Negative Logits
ktop
-0.86
hammad
-0.82
gans
-0.75
cler
-0.74
ikk
-0.72
acks
-0.72
ni
-0.70
feeding
-0.70
xual
-0.69
trust
-0.69
POSITIVE LOGITS
istically
1.01
fictional
1.01
ãĤ¼ãĤ¦ãĤ¹
0.92
acters
0.91
portray
0.90
universes
0.89
recre
0.88
ized
0.88
portrayal
0.84
depictions
0.81
Activations Density 0.013%