INDEX
Explanations
fragments related to publication or conclusion of something, like a book or a study
statements exhibiting criticism or skepticism about social behaviors or observations
New Auto-Interp
Negative Logits
âĢ
-1.04
âĢ
-0.87
âĿ
-0.85
âĸº
-0.84
âľĶ
-0.84
ðŁ
-0.83
ðŁij
-0.81
¨
-0.79
-0.77
âľ
-0.75
POSITIVE LOGITS
hindsight
0.72
Canaver
0.72
later
0.71
pity
0.70
1956
0.69
1953
0.69
amused
0.69
1935
0.69
Kubrick
0.68
1936
0.68
Activations Density 1.106%