Deep learning enhances earthquake monitoring capabilities by mining seismic waveforms directly. However, current neural networks, trained within specific areas, face challenges in generalizing to diverse regions. Here, we employ a data recombination method to create generalized earthquakes occurring at any location with arbitrary station distributions for neural network training. The trained models can then be applied to various regions with different monitoring setups for earthquake detection and parameter evaluation from continuous seismic waveform streams. This allows real-time Earthquake Early Warning (EEW) to be initiated at the very early stages of an occurring earthquake. When applied to substantial earthquake sequences across Japan and California (US), our models reliably report earthquake locations and magnitudes within 4 seconds after the first triggered station, with mean errors of 2.6-6.3 km and 0.05-0.17, respectively. These generalized neural networks facilitate global applications of realtime EEW, eliminating complex empirical configurations typically required by traditional methods.Earthquake monitoring is a primary task in seismology, and reporting earthquake parameters in real time has long been a critical effort for earthquake early warning (EEW) [1][2][3][4] . The current EEW systems typically require a few seconds to 1 min after an earthquake occurs to issue warning information to public 3 . The systems commonly consist of several modules, including data processing, source parameter evaluation, and alert filtering, used to report earthquake alarms based on continuous waveform streams [5][6][7] . The alarm is triggered if specific conditions are met, such as distinguishing teleseismic events, triggering a certain number of stations, reaching a certain magnitude, and meeting a percentage threshold of triggered stations (e.g., 40%) 5,7 .Complex empirical threshold value settings are involved in each processing step, making it a challenge to define the optimal alert criteria in EEW systems. Implementing overly strict criteria, such as requiring too many or a large number of stations to trigger, can negatively impact the real-time efficiency of EEW systems, while loose criteria can result in false alarms 5 .Theoretically, earthquake parameter determination may require data from at least four triggered stations to ensure accuracy [5][6][7] . The magnitude determination often requires 3 s P arrivals for a single station [8][9] . Therefore, the time delay for issuing a warning is the duration from the origin time to 3 seconds after the last station triggers when multiple stations are used to estimate the magnitude. However, in real applications, the time delay for issuing a warning may be even longer due to malfunctions in stations or system delays 6,10 . Therefore, an efficient real-time monitoring algorithm should not only be computationally fast in a concise way without extensive empirical configurations but also be able to solve the earthquake parameters with limited data available from the trigg...