INDEX
Explanations
references to specific individuals' names, especially "Miranda" and "Amar," indicating it detects mentions of particular people in the text
New Auto-Interp
Negative Logits
UnusedPrivate
-0.71
WebVitals
-0.70
balleur
-0.69
nakalista
-0.66
adaptiveStyles
-0.66
complexType
-0.64
beginnetje
-0.63
rawDesc
-0.63
Chbosky
-0.63
opheles
-0.63
POSITIVE LOGITS
Amar
0.94
igs
0.88
Amar
0.88
amar
0.86
Miranda
0.83
imation
0.75
0.73
amar
0.70
Miranda
0.68
Swanson
0.68
Activations Density 0.044%