{"id":718,"date":"2013-07-25T18:24:46","date_gmt":"2013-07-25T18:24:46","guid":{"rendered":"https:\/\/mtlsites.mit.edu\/annual_reports\/2013\/?p=718"},"modified":"2013-08-13T20:24:13","modified_gmt":"2013-08-13T20:24:13","slug":"caplet-field-solver-accurate-real-time-capacitance-extraction-toolkit-using-instantiable-basis-functions","status":"publish","type":"post","link":"https:\/\/mtlsites.mit.edu\/annual_reports\/2013\/caplet-field-solver-accurate-real-time-capacitance-extraction-toolkit-using-instantiable-basis-functions\/","title":{"rendered":"CAPLET: Field-solver Accurate Real-time Capacitance Extraction Toolkit using Instantiable Basis Functions"},"content":{"rendered":"
Traditional capacitance extraction for VLSI interconnects usually adopts 2D scanning and table look-up methods for fast extraction at acceptable accuracy. However, for certain full-3D structures, such as partially overlapping wires and comb capacitors, 2D scanning methods fail to generate accurate results (within 5% errors). Therefore, using 3D field solvers becomes necessary despite the much slower performance. Some accelerated capacitance extraction field solvers have been proposed in the past decades, e.g., FASTCAP[1<\/a>]<\/sup> and pre-corrected FFT\u00a0[2<\/a>]<\/sup>, whose accelerations are effective only for large, semi-global structures of hundreds of wires. More importantly,[3<\/a>]<\/sup> and[4<\/a>]<\/sup> have demonstrated that the two acceleration methods are not efficiently parallelizable, showing rapid degradation of parallel efficiency with the number of cores (40% to 60% at eight cores).<\/p>\n This work targets the sub-second extraction performance within 5% errors for small-to-medium-sized structures of tens of wires. We adopt our instantiable basis functions (Figure 1(a)) developed in\u00a0[5<\/a>]<\/sup> as compact representations of charge distribution in a boundary element method. Such compactness both provides acceleration in serial execution (6x faster than FASTCAP for the NAND gate in Figure 1(b)), and achieves linear scalability of parallel efficiency (Figure 2). The key idea is that using such compact representations reduces the parallelization bottleneck, system solving computation, for from the original 90% of total computation to less than 5%. Hence the embarrassingly parallelizable part, filling the system matrix, is now dominant (from 10% of total computation to more than 95%)[6<\/a>]<\/sup>. In four-core parallel execution, we achieve a 38-ms extraction time for an inverter and 190-ms for a NAND gate. Our toolkit can directly handle GDS2 layouts to generate capacitance matrices and 3D visualization (Figure 1(b)). Our code is available in the public domain at http:\/\/www.rle.mit.edu\/cpg\/codes\/caplet\/index.html<\/a>.<\/p>\n\n\t\t