INDEX
Explanations
personalities or professions represented by proper nouns
titles and roles associated with various professions and expertise
New Auto-Interp
Negative Logits
etheless
-0.80
conclud
-0.77
compr
-0.75
atever
-0.72
arching
-0.68
âķIJâķIJ
-0.68
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.67
ilogy
-0.66
WARNING
-0.66
ש
-0.63
POSITIVE LOGITS
extraord
1.55
Mike
1.17
Joey
1.15
Jason
1.14
Kevin
1.13
Cody
1.11
Dave
1.11
Randy
1.10
Andy
1.10
Steve
1.10
Activations Density 0.230%