INDEX
Explanations
instances of specific words like 'beth', 'eth', 'oven', 'aul', and 'ier'
mentions of entities or names, particularly those related to classical composers
New Auto-Interp
Negative Logits
slot
-0.70
tremend
-0.70
border
-0.66
Malays
-0.62
propensity
-0.62
detail
-0.60
ashtra
-0.59
frontier
-0.59
easing
-0.56
resultant
-0.56
POSITIVE LOGITS
zeb
0.98
lehem
0.88
leck
0.82
mingham
0.80
iful
0.79
reau
0.76
oven
0.75
nown
0.73
fleet
0.72
onge
0.71
Activations Density 0.054%