INDEX
Explanations
emotions and reactions related to surprise or disbelief
moments of surprise or realization
New Auto-Interp
Negative Logits
camp
-0.79
İĭ
-0.76
ngth
-0.75
conservancy
-0.74
hang
-0.70
vend
-0.69
oku
-0.68
ongo
-0.67
hunt
-0.65
hod
-0.62
POSITIVE LOGITS
seeing
1.09
hearing
1.09
witnessing
1.02
discovering
0.98
realizing
0.95
reading
0.93
heard
0.91
realised
0.90
Seeing
0.89
realization
0.89
Activations Density 0.211%