INDEX
Explanations
words related to professions, such as attorney, lawyer, investigator, and author
references to legal professionals and family relationships
New Auto-Interp
Negative Logits
twitch
-0.67
"]=>
-0.64
rising
-0.64
attering
-0.64
avers
-0.64
verage
-0.62
uchin
-0.62
flush
-0.62
discrimination
-0.62
lems
-0.62
POSITIVE LOGITS
Brendan
0.99
Andrew
0.99
Jamie
0.98
Nathaniel
0.97
Jonah
0.97
Ian
0.97
Roger
0.96
Linda
0.95
Patrick
0.95
Geoffrey
0.95
Activations Density 0.202%