INDEX
Explanations
instances of discussion or storytelling related to personal or team experiences
after commas and periods
names of people and places
New Auto-Interp
Negative Logits
Perſ
-0.91
Reſ
-0.85
Diſ
-0.85
Inſ
-0.80
Conſ
-0.80
Majefty
-0.77
Houſe
-0.76
Anſ
-0.76
ſche
-0.75
Perfon
-0.74
POSITIVE LOGITS
Mc
0.68
Smith
0.67
Johnson
0.65
Williams
0.63
Jones
0.62
Mc
0.61
van
0.60
Singh
0.59
כה
0.58
Baillargeon
0.57
Activations Density 2.532%