INDEX
Explanations
themes related to the balance between individual actions and communal responsibilities
New Auto-Interp
Negative Logits
ammen
-0.16
oshi
-0.16
sir
-0.16
Tide
-0.15
umen
-0.15
acob
-0.15
arrow
-0.14
баÑĩ
-0.14
okus
-0.14
ResponseStatus
-0.14
POSITIVE LOGITS
Aaron
0.23
³
0.20
Aaron
0.17
camp
0.17
Lev
0.16
Bez
0.16
Nad
0.15
Mish
0.15
-tab
0.15
tab
0.15
Activations Density 0.027%