INDEX
Explanations
proper nouns and specific terms or phrases within longer and diverse texts
references to names and notable events or actions
New Auto-Interp
Negative Logits
fortun
-0.94
seiz
-0.89
ãĤ¦ãĤ¹
-0.73
distingu
-0.71
sugg
-0.69
ifter
-0.68
jri
-0.65
advoc
-0.65
arrang
-0.64
compr
-0.63
POSITIVE LOGITS
Banana
0.83
Raven
0.81
Nin
0.80
Kenyan
0.79
Raspberry
0.77
Nickel
0.77
Monte
0.75
HRC
0.75
Peach
0.75
Skydragon
0.75
Activations Density 0.692%