INDEX
Explanations
prominent single words or short phrases that establish key ideas or themes within the text
New Auto-Interp
Negative Logits
overall
-0.41
Overall
-0.32
overall
-0.32
formula
-0.31
bland
-0.31
rze
-0.31
Overall
-0.30
veo
-0.29
contentType
-0.28
Elba
-0.28
POSITIVE LOGITS
excerpts
0.80
excerpt
0.79
extracts
0.79
Quoted
0.72
tagHelperRunner
0.71
extrait
0.71
quotes
0.69
AddTagHelper
0.68
Extracts
0.68
excerpt
0.68
Activations Density 0.001%