We present a novel, data-driven approach to predict systematic model errors in the ocean component of a coupled general circulation model leveraging deep learning and data assimilation. We examine the skill of the proposed scheme in learning systematic model errors, including their spatial patterns, variance, scales, and test its sensitivity to different predictors and neural network architecture. The scheme utilizes local state variables such as ocean temperature, salinity, velocities, and surface fluxes to predict corrections to temperature tendency for the upper 1000 meters in the ocean on daily timescales. The performance is evaluated on the withheld test dataset and compared against the empirical climatological temperature corrections that are geographically dependent. The performance is depth-dependent, with significant improvements over the benchmark in the upper 20 meters in the ocean. It degrades rapidly with depth but remains comparable to the climatology benchmark. Neural networks can capture up to 40-50% of the daily variance in temperature increments in the upper 20 meters relative to the benchmark’s 20%. The improvements are associated with networks predicting finer spatiotemporal scales than the benchmark. They are expected to perform better in reducing surface ocean mixed layer bias than previously used techniques. Despite being column-local without geographical inputs, networks can sufficiently reproduce spatial patterns on daily and longer timescales. The patterns consist of corrections to regional dynamical features such as western boundary currents, equatorial undercurrents, bathymetry-related corrections in the Southern Ocean, and warm surface increments over subtropical and midlatitude belts.