INDEX
Explanations
proper nouns or specific names
phrases related to the title and content of specific television series or films
New Auto-Interp
Negative Logits
usercontent
-0.62
bases
-0.58
ciplinary
-0.56
ĸļ
-0.56
chnology
-0.54
soDeliveryDate
-0.53
probes
-0.53
surveys
-0.53
channelAvailability
-0.52
alore
-0.52
POSITIVE LOGITS
âĻ¥
0.57
uly
0.56
Meditation
0.53
iva
0.53
stic
0.52
gp
0.51
outp
0.51
heit
0.51
ili
0.50
Ń·
0.50
Activations Density 0.889%