To advance Polar code design for 6G functions, we develop a reinforcement learning-based common sequence design framework that’s extensible and adaptable to numerous channel situations and decoding methods. Crucially, our technique scales to code lengths as much as 2048, making it appropriate to be used in standardization. Throughout all configurations supported in 5G, our method achieves aggressive efficiency relative to the NR sequence adopted in 5G and yields as much as a 0.2 dB achieve over the beta-expansion baseline at . We additional spotlight the important thing components that enabled studying at scale: (i) incorporation of bodily legislation constrained studying grounded within the common partial order property of Polar codes, (ii) exploitation of the weak long run affect of choices to restrict lookahead analysis, and (iii) joint multi-configuration optimization to extend studying effectivity.

