INDEX
Explanations
proper nouns or specific terms
references to significant events or objects, particularly in news or storytelling contexts
New Auto-Interp
Negative Logits
iencies
-0.77
respectively
-0.72
ertain
-0.65
ctors
-0.62
comply
-0.61
evenly
-0.60
enegger
-0.60
ptions
-0.60
iott
-0.59
aintain
-0.59
POSITIVE LOGITS
Called
0.73
Something
0.71
namely
0.70
calling
0.67
%:
0.67
Someone
0.67
HK
0.65
URA
0.64
possibly
0.64
çī
0.63
Activations Density 1.081%