INDEX
Explanations
the word "optimistic" or related terms
expressions of optimism
New Auto-Interp
Negative Logits
avis
-0.87
ĸļ
-0.80
ngth
-0.79
drivers
-0.77
deep
-0.74
hid
-0.74
artifacts
-0.74
avery
-0.73
Mamm
-0.72
conservancy
-0.70
POSITIVE LOGITS
optimistic
1.16
optimism
1.16
essim
1.14
upbeat
0.97
outlook
0.90
pessim
0.86
pessimistic
0.85
hopeful
0.85
llor
0.84
hope
0.79
Activations Density 0.020%