INDEX
Explanations
mentions of the word "Oxford."
New Auto-Interp
Negative Logits
extAlignment
-0.47
gefü
-0.44
Mifflin
-0.43
SpringRunner
-0.40
geschlagen
-0.39
adpleegd
-0.39
MessageTagHelper
-0.38
Prairie
-0.38
Cardigan
-0.38
Luzon
-0.37
POSITIVE LOGITS
Barcelona
1.08
Milan
1.03
Oxford
0.98
Swiss
0.97
Barcelona
0.94
Liverpool
0.94
Milan
0.91
Switzerland
0.91
Oxford
0.88
Geneva
0.88
Activations Density 0.136%