INDEX
Explanations
discussions of admissions, confessions, and expressions of humility
New Auto-Interp
Negative Logits
еÑĢо
-0.17
addons
-0.16
Closure
-0.15
peÄį
-0.15
بÙĪØ§Ø³Ø·Ø©
-0.15
MBED
-0.15
олÑĮз
-0.15
zÃŃ
-0.14
اÙĦÙĩ
-0.14
alez
-0.14
POSITIVE LOGITS
admit
0.77
admission
0.73
admitting
0.72
admitted
0.71
admits
0.69
admissions
0.65
Admission
0.60
confess
0.53
acknowledge
0.50
confessed
0.50
Activations Density 0.279%