INDEX
Explanations
mentions of people's professions or roles
the article "a" indicating instances of individuals, roles, or descriptions
New Auto-Interp
Negative Logits
views
-0.80
Edit
-0.75
querade
-0.75
alion
-0.75
mares
-0.74
iments
-0.73
attacks
-0.73
fn
-0.72
answer
-0.71
encies
-0.69
POSITIVE LOGITS
believer
1.00
fixture
0.98
bit
0.97
proud
0.96
descendant
0.95
tad
0.93
member
0.93
reluctant
0.90
participant
0.90
lifelong
0.88
Activations Density 0.242%