INDEX
Explanations
references to things that are described as generic
instances of the word "generic."
New Auto-Interp
Negative Logits
oir
-0.74
kers
-0.74
otos
-0.73
=-=-=-=-=-=-=-=-
-0.73
cano
-0.72
Jenn
-0.70
Ferry
-0.70
rey
-0.68
=-=-
-0.66
tein
-0.66
POSITIVE LOGITS
generic
0.84
ization
0.81
isable
0.77
interchange
0.75
applic
0.74
ality
0.73
ALLY
0.71
generic
0.69
isation
0.69
ised
0.69
Activations Density 0.011%