INDEX
Explanations
references to family relationships and sports team details
New Auto-Interp
Negative Logits
Joshua
-0.62
Amanda
-0.61
Jason
-0.59
Joshua
-0.59
Matthew
-0.59
Amanda
-0.57
amanda
-0.56
jason
-0.56
Jessica
-0.56
Christopher
-0.56
POSITIVE LOGITS
Bob
1.14
Dick
1.11
Bob
1.04
Dick
0.98
Jim
0.95
Bill
0.95
Larry
0.86
Jerry
0.85
Jim
0.85
Bill
0.84
Activations Density 0.354%