INDEX
Explanations
expressions of hope and optimism
New Auto-Interp
Negative Logits
crites
-0.61
мәкал
-0.60
Claudius
-0.59
անդ
-0.59
wik
-0.56
xe
-0.55
body
-0.55
Sainte
-0.54
azah
-0.53
Bod
-0.52
POSITIVE LOGITS
hope
1.17
hopes
1.13
hopeful
1.13
Hoffnung
1.12
optimism
1.08
Hopes
1.08
HOPE
1.06
espoir
1.05
prospects
1.05
optimistic
1.05
Activations Density 0.202%