INDEX
Explanations
occurances of the word "Bab" with a varying activation level for each number
occurrences of the word "Bab" in various contexts
New Auto-Interp
Negative Logits
gard
-0.65
REPORT
-0.63
mesh
-0.61
Ceres
-0.60
Copenhagen
-0.59
ICES
-0.58
âĶĢâĶĢ
-0.58
Frey
-0.58
Californ
-0.57
Fargo
-0.57
POSITIVE LOGITS
cock
1.13
ylon
1.05
alon
0.97
raham
0.97
ush
0.97
yl
0.91
aji
0.91
amia
0.86
ies
0.86
lov
0.85
Activations Density 0.029%