INDEX
Explanations
names of people or entities
New Auto-Interp
Negative Logits
EMS
-0.79
yrinth
-0.72
romy
-0.72
iasm
-0.71
icult
-0.68
aunders
-0.68
iths
-0.68
istar
-0.67
psey
-0.65
nuts
-0.65
POSITIVE LOGITS
plates
1.44
plate
1.29
paces
1.07
redacted
0.94
names
0.94
tag
0.94
tags
0.93
ames
0.91
names
0.88
recognition
0.88
Activations Density 1.975%