INDEX
Explanations
names containing the sub-string "bro."
occurrences of the name "Bro" or similar variations of that name
New Auto-Interp
Negative Logits
ngth
-0.71
llah
-0.65
Pengu
-0.63
idates
-0.62
spoilers
-0.61
iosyncr
-0.61
showc
-0.60
Uriel
-0.60
oshi
-0.59
fines
-0.58
POSITIVE LOGITS
reau
0.83
ħĭ
0.79
necks
0.78
deals
0.75
trout
0.75
lyn
0.74
д
0.70
mire
0.69
swick
0.68
furt
0.67
Activations Density 0.106%