INDEX
Explanations
names of people or entities that are being recognized or awarded
the word "include" and its context within a list or examples
New Auto-Interp
Negative Logits
ritic
-0.79
enser
-0.74
oler
-0.73
istical
-0.71
kin
-0.71
reading
-0.68
rites
-0.68
bis
-0.67
alloween
-0.67
dule
-0.66
POSITIVE LOGITS
:-
0.91
*:
0.89
Ala
0.85
:
0.80
:[
0.77
:(
0.75
:#
0.73
Flo
0.73
Fernand
0.73
Rudolph
0.72
Activations Density 0.209%