INDEX
Explanations
specific details or information mentioned in a text
occurrences of the word "details" or its variants
New Auto-Interp
Negative Logits
asus
-0.79
nder
-0.74
ONY
-0.70
ammers
-0.70
azar
-0.69
rade
-0.69
sw
-0.66
arte
-0.66
ony
-0.64
onz
-0.64
POSITIVE LOGITS
details
1.10
detail
0.95
displayText
0.91
glean
0.85
redacted
0.83
thereof
0.83
bourg
0.80
surrounding
0.79
TBA
0.76
Details
0.76
Activations Density 0.030%