INDEX
Explanations
proper names and specific entities related to culture and medicine
key terms related to specific individuals or entities associated with a narrative
New Auto-Interp
Negative Logits
ired
-0.71
requ
-0.70
ools
-0.70
IRED
-0.68
RIC
-0.67
lot
-0.67
tin
-0.64
rier
-0.64
nic
-0.64
orne
-0.63
POSITIVE LOGITS
oka
1.04
uten
0.85
ĸļ
0.83
largeDownload
0.74
MV
0.69
ichi
0.69
stabil
0.68
Bore
0.66
âĺħâĺħ
0.66
gio
0.66
Activations Density 0.012%