INDEX
Explanations
mentions of the word "ola."
instances of the word "Lola"
New Auto-Interp
Negative Logits
ergy
-0.75
wcsstore
-0.74
frames
-0.71
picking
-0.67
ERAL
-0.66
rish
-0.66
sworth
-0.66
rings
-0.65
rant
-0.65
frame
-0.64
POSITIVE LOGITS
onga
0.94
iffe
0.93
ño
0.91
BILITY
0.91
isi
0.89
ppe
0.89
zzle
0.87
zzi
0.87
uthor
0.86
uga
0.83
Activations Density 0.010%