INDEX
Explanations
phrases related to confessions or admissions
instances of the word "admit" or its variations
New Auto-Interp
Negative Logits
prot
-0.66
????????
-0.66
bench
-0.65
grid
-0.62
map
-0.61
cycle
-0.61
shape
-0.61
ripple
-0.61
lance
-0.60
rend
-0.60
POSITIVE LOGITS
admitted
3.59
admits
2.30
confessed
2.29
admit
2.19
acknowledged
2.02
conceded
2.01
admitting
1.98
admission
1.95
admissions
1.65
concedes
1.53
Activations Density 0.008%