INDEX
Explanations
phrases or words related to referencing or mentioning specific information
repeated references to mentioned concepts or topics within the text
New Auto-Interp
Negative Logits
isol
-0.76
apo
-0.68
ffic
-0.66
PU
-0.65
capt
-0.64
bor
-0.64
iolet
-0.63
Clicker
-0.62
idity
-0.62
cession
-0.61
POSITIVE LOGITS
herein
0.92
above
0.83
Parenthood
0.81
inconsist
0.78
by
0.76
hereafter
0.75
è£ıè
0.75
therein
0.74
supra
0.74
ãĤ´ãĥ³
0.71
Activations Density 0.180%