INDEX
Explanations
references to coffee-related terms and activities within a specific setting
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1140
+0.15
0.6%
1194
+0.14
0.5%
486
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1140
+0.15
0.03
1194
+0.14
0.02
486
+0.13
0.02
Negative Logits
accla
-0.97
shenan
-0.93
ridu
-0.91
sappi
-0.90
cushi
-0.88
scopri
-0.88
cammin
-0.88
purtroppo
-0.87
poichè
-0.86
vogli
-0.86
POSITIVE LOGITS
coffee
1.57
coffee
1.40
Coffee
1.37
Coffee
1.29
COFFEE
1.07
coffees
1.05
FFEE
0.95
cafe
0.91
咖啡
0.90
caffeine
0.90
Activations Density 0.072%