INDEX
Explanations
phrases that indicate personal connections and experiences in a narrative context
New Auto-Interp
Negative Logits
âĹĦ
-0.15
Station
-0.14
ssp
-0.14
readers
-0.14
lingen
-0.14
narrator
-0.13
Styles
-0.13
uchs
-0.13
afs
-0.13
reader
-0.13
POSITIVE LOGITS
movie
0.54
film
0.52
movie
0.44
film
0.41
movies
0.40
films
0.38
pelÃŃcula
0.37
æĺłçĶ»
0.36
ÑĦилÑĮ
0.36
ÑĦÑĸлÑĮ
0.36
Activations Density 0.112%