INDEX
Explanations
the word "these" followed by a noun
references to a recurring subject or object within the text
New Auto-Interp
Negative Logits
achus
-0.74
ppe
-0.73
iness
-0.73
hood
-0.73
onis
-0.70
heit
-0.69
Da
-0.67
zzle
-0.67
john
-0.67
rix
-0.66
POSITIVE LOGITS
kinds
1.01
sorts
0.90
fellows
0.83
particular
0.81
proceedings
0.81
types
0.77
newfound
0.77
conduc
0.75
findings
0.75
developments
0.74
Activations Density 0.083%