INDEX
Explanations
mentions of a specific person, likely named Adrian
occurrences of the name "Adrian" and variations of it
New Auto-Interp
Negative Logits
creen
-0.95
ipeg
-0.84
itude
-0.78
cham
-0.77
rooms
-0.71
atonin
-0.69
heet
-0.68
uge
-0.67
ening
-0.66
pard
-0.65
POSITIVE LOGITS
Peterson
0.79
asus
0.76
lia
0.70
nered
0.68
hoe
0.68
Ake
0.66
verse
0.66
amara
0.64
Amos
0.64
APH
0.63
Activations Density 0.082%