INDEX
Explanations
elements related to movie reviews and production credits
New Auto-Interp
Negative Logits
ndo
-0.15
(strict
-0.15
ëıĻìķĪ
-0.14
Sas
-0.14
Pron
-0.14
'=>"
-0.14
tpl
-0.14
brittle
-0.14
idf
-0.13
anders
-0.13
POSITIVE LOGITS
Fast
0.27
Fast
0.26
Furious
0.25
FAST
0.24
fast
0.24
fast
0.23
Vin
0.22
-fast
0.21
FAST
0.20
Diesel
0.20
Activations Density 0.015%