INDEX
Explanations
phrases related to accountability and responsibility in various contexts
New Auto-Interp
Negative Logits
Ì£
-0.17
izon
-0.17
veis
-0.16
صÙĪØ±
-0.16
θοÏĤ
-0.15
izar
-0.15
TOTYPE
-0.14
бин
-0.14
rades
-0.14
APPER
-0.14
POSITIVE LOGITS
one
0.17
author
0.16
906
0.15
institutions
0.15
lic
0.15
ught
0.14
990
0.14
author
0.14
264
0.14
civilizations
0.14
Activations Density 0.274%