INDEX
Explanations
dates or specific points in time mentioned in the text
repetitive references to "this" and "that" in the context of specific topics or occasions
New Auto-Interp
Negative Logits
upon
-0.76
letes
-0.70
ocate
-0.69
encers
-0.69
atron
-0.68
ãĥ©ãĥ³
-0.68
ults
-0.65
RIC
-0.64
inals
-0.63
LER
-0.61
POSITIVE LOGITS
occasions
1.53
occasion
1.35
behalf
1.25
basis
1.14
shores
0.99
fronts
0.98
doorstep
0.96
occas
0.94
ilts
0.93
pretext
0.93
Activations Density 0.117%