INDEX
Explanations
mentions of specific names, likely proper nouns
names and terms related to specific individuals and styles, particularly in the context of architecture and pop culture
New Auto-Interp
Negative Logits
concess
-0.79
icum
-0.74
provision
-0.73
inent
-0.71
stru
-0.70
etheus
-0.70
ocrin
-0.69
sole
-0.68
compr
-0.67
iden
-0.67
POSITIVE LOGITS
glers
1.15
vernment
0.96
Pengu
0.84
irlfriend
0.81
ospels
0.80
IVERS
0.77
ORGE
0.77
ourmet
0.77
gets
0.76
FFER
0.75
Activations Density 0.059%