The G×E×M model fuses three specialized encoders through a novel trilinear attention mechanism, capturing pairwise and three-way interaction tensors between genomic, environmental, and management spaces.
Models epistatic interactions as a graph problem. Each SNP is a node; linkage disequilibrium thresholds define edges. Four stacked GAT layers with 8 attention heads each capture long-range genomic interactions across chromosomes.
A 6-layer Transformer ingests 365 daily climate vectors (T_max, T_min, precipitation, solar radiation, VPD, wind speed, humidity). The CLS token learns to aggregate seasonal patterns relevant to yield formation.
Encodes 12 agronomic management variables: nitrogen rate, planting density, irrigation amount and timing, tillage type, pesticide applications, cover crop, planting date, row spacing, and seed treatment.
The three embeddings g, e, m ∈ ℝ^256 are fused through a learned trilinear attention mechanism that captures all pairwise and three-way interaction tensors.
A_ge = softmax((g W_Q)(e W_K)ᵀ / √d)A_gm = softmax((g W_Q)(m W_K)ᵀ / √d)A_em = softmax((e W_Q)(m W_K)ᵀ / √d)z = FFN([g; e; m; A_ge⊗m; A_gm⊗e; A_em⊗g])ŷ = MLP(z) | σ² = MLP(z) via MC-DropoutL = MSE(ŷ, y) + λ KL[q(w)||p(w)]Evaluated on G2F Genomes-to-Fields maize dataset, 2014–2022, leave-environment-out cross-validation.