INDEX
Explanations
phrases related to accountability and admission of responsibility
New Auto-Interp
Negative Logits
Closure
-0.17
Closure
-0.15
بÙĪØ§Ø³Ø·Ø©
-0.15
zÃŃ
-0.14
abus
-0.14
ycz
-0.14
FRING
-0.14
олÑĮз
-0.14
ettel
-0.14
OVÃģ
-0.14
POSITIVE LOGITS
admission
0.84
admit
0.81
admitted
0.80
admitting
0.77
admits
0.73
admissions
0.73
Admission
0.69
acknowledge
0.59
acknowledgement
0.57
acknowledgment
0.56
Activations Density 0.339%