INDEX
Explanations
statistical or factual information
recurring references to factual statements or information
New Auto-Interp
Negative Logits
avorite
-0.83
Klux
-0.81
artney
-0.75
isoft
-0.72
JV
-0.71
jri
-0.69
Carbuncle
-0.68
gewater
-0.68
charcoal
-0.67
hod
-0.67
POSITIVE LOGITS
ually
1.29
orial
1.18
ional
1.15
itious
1.08
oids
1.03
ual
1.02
ially
0.95
oid
0.95
uality
0.91
icity
0.89
Activations Density 0.028%