The theoretical Examination demonstrates that EDIS displays lowered suboptimality as compared to exclusively using on line info or directly reusing offline info. EDIS is often a plug-in strategy and might be coupled with present strategies in offline-to-on line RL setting. By employing EDIS to off-the-shelf methods Cal-QL and IQL, we observe a nota