INDEX
Explanations
mentions of amphitheaters and other related terms
references to amphitheaters or similar performance venues
New Auto-Interp
Negative Logits
Benz
-0.75
Leilan
-0.72
lished
-0.68
penetrating
-0.66
enegger
-0.64
Origin
-0.64
Squirrel
-0.63
developing
-0.61
Annotations
-0.61
Prediction
-0.60
POSITIVE LOGITS
ithe
1.41
atre
1.35
selage
0.96
aviour
0.91
rette
0.90
aque
0.88
tesy
0.88
aton
0.87
reen
0.86
gregation
0.85
Activations Density 0.009%