INDEX
Explanations
the word "thought" followed by a high confidence level
expressions of personal opinion or judgment about movies
New Auto-Interp
Negative Logits
kw
-0.74
conservancy
-0.69
ç«
-0.67
iona
-0.64
PLUS
-0.63
versions
-0.62
Naz
-0.61
acerb
-0.60
osures
-0.60
Footnote
-0.59
POSITIVE LOGITS
fully
1.00
lessly
0.94
fulness
0.89
ij士
0.72
Catalog
0.71
ileaks
0.67
Archdemon
0.66
ful
0.66
Parks
0.65
culus
0.64
Activations Density 0.054%