Downloaded by guest on October 1, 2021 eplearning deep industry. informal in compliance regulatory of implications extent and the understand better to combined be and can observation govern- learning Earth machine the how in demonstrates approach regulations Our local data. to ment inaccuracies respect document with bias We potential direction. and unfavorable 2.5 an than from less blows diameter a (illegally) 20.4 of are to matter kilns up of contribute kilns 9% we Finally, how and respectively. facilities, show 77% live health and that schools people) and of km million kiln 1 within a (>18 of km least of 1 at within population that the show we of kiln estimates, brick 12% these every of Using coordinates Bangladesh. GPS across precise pre- the 88.7% extracts and accuracy and Bangladesh. 94.2% cision from with kilns imagery identifies approach satellite Our locating high-resolution for in approach set- kilns scalable low-income brick low-cost, a in is demonstrate challenge but We common tings. pollution to regulate—a of difficult and are source who monitor producers major informal a small-scale, by is dominated manufacturing South brick In populations. 2020) Asia, healthy 8, September and review environments for clean (received promoting 2021 for critical 3, is regulations February environmental approved with compliance and Improving Norway, Oslo, Oslo, of University Underdal, Arild by Edited 94305 CA Stanford, University, Stanford Health, Global in Innovation 94305; CA Stanford, University, Stanford 94305; CA Stanford, University, a Biswas Debashish Lee aid Jihyeon and kilns capacity brick regulatory identify to learning deep Scalable naeaeb .7yprpro u oarpluin hc is Bangladesh, which In PNAS (15). pollution, world air the to in due country person other per any y reduced than is 1.87 more Bangladesh by in average expectancy on life that annual suggest for responsible estimates total is Bangladesh’s and 12) of (11, 17% operate kilns when the of months 50% winter to of 30 manufactur- driver for brick responsible major dis- is that a ing significant estimate is Models cause which (10–13). carbon, and change black climate lungs and the mortality, inside and ease deep inhaled 2.5 be than less can including diameter a atmosphere, of of the matter number into particulate a envi- gases emit greenhouse substantial kilns and with Traditional pollutants harm. associated health is and industry ronmental this foremost, First compliance. and environmental for for approaches resources low-cost driven, because and observe (9). to limited often hard activ- are is enforcement and regulatory firms location these the of because ity both exacerbated these coun- pollution, are substantial developing problems for and In tanneries, responsible kilns, are observed. brick mines as small-scale readily such industries is informal where firms tries of where countries location developed litera- highly the this in applications of on most focuses However, ture (5–8). and agencies regulations regulatory environmental monitoring, support of of costs compliance the and reduce enforcement, can (1–4). obser- learning Earth machine violators in and sanction advances vation how or demonstrates punish literature growing effectively A to ability the and E Bangladesh optrSineDprmn,Safr nvriy tnod A94305; CA Stanford, University, Stanford Department, Science Computer rc auatrn sa xeln aet eeo data- develop to case excellent an is manufacturing Brick euaoyaece fe akifraino compliance on information lack difficult. often notoriously agencies is Regulatory regulations environmental nforcing 01Vl 1 o 7e2018863118 17 No. 118 Vol. 2021 | i pollution air | a,1 aelt imagery satellite iaR Brooks R. Nina , e n tpe .Luby P. Stephen and , c od nttt o h niomn,Safr nvriy tnod A94305; CA Stanford, University, Stanford Environment, the for Institute Woods | niomna regulations environmental b,1,2 e µµ CO nentoa etefrDaroa ies eerh agaeh hk 22 agaeh and Bangladesh; 1212, Bangladesh, Research, Disease Diarrhoeal for Centre International )i hk hntewind the when Dhaka in m) µµ PM g/m 2 ai Tajwar Fahim , msin 1) Recent (14). emissions 2.5 3 fPM of nDaadrn the during Dhaka in (PM µm c,f 2.5 (particulate | 2.5 ,which ), a asalBurke Marshall , b metItricpiayPormi niomn n eore,Stanford Resources, and Environment in Program Interdisciplinary Emmett doi:10.1073/pnas.2018863118/-/DCSupplemental at online information supporting contains article This 2 1 ulse pi 2 2021. 22, April Published BY-NC-ND) (CC 4.0 NoDerivatives License distributed under is article access a open This Submission.y AtlasAI, Direct economic PNAS of a and is cofounders article quality This infrastructure are measure world.y S.E. developing to the learning and in outcomes machine D.B.L., uses D.B.L., M.B., that S.E., company M.B., statement: F.T., N.R.B., interest J.L., in Competing and used data; paper. data y the J.L., analyzed primary wrote F.T. S.P.L. research; contributed and and D.B., designed D.B. N.R.B., S.P.L. and J.L., and N.R.B. analysis; research; D.B.L., the performed S.E., F.T. M.B., and N.R.B., N.R.B., J.L., contributions: Author satellite sur- in visible and them large, (see making bricks, images reasonably drying of distinct, rows visually (16–18). by rounded very kilns are coal-fired kilns inefficient, pro- Brick highly are bricks traditional, Asia, in South duced across and Bangladesh, In approach. past the a over As rapidly 17). 18). (17, proliferated (16, decades have construction few kilns to traditional central mate- is result, construction other production many brick of rials, sources domestic lacks which infrtegvrmn.A nomlklsaecntnl being constantly are kilns informal As government. the informa- kilns this for obtaining brick tion of of importance labor-intensive the locations and underscores time- that the effort incredibly verify (20)—an and country launched the map recently across manually (GoB) to Bangladesh study (16, of a minimal formal Government is The this kilns 19). and eschew unregistered against government many enforcement the but process; with clearance, in register environmental kiln a must obtain establish owner legally an To bet- target decisions. Bangladesh, support regulatory society and and risk, civil policy at or ter are agencies who populations regulatory identify help violations, can located are owo orsodnemyb drse.Eal [email protected] Email: addressed. be may correspondence whom work.y To this to equally contributed N.R.B. and J.L. eeaeifraino e olto sources. approach pollution Our key to on country. agencies information regulatory the generate for method of replicable low-cost, well-being a and offers implications health has the which for manufacturing, regula- national brick the governing of tions violations widespread reveal imagery. data satellite Our in Bangladesh, in industry highly dis- informal a polluting of kilns, brick numbers identifying for large approach scalable machine-learning monitor accurate, an and demonstrates study locate This polluters. persed to governments the ability because the pollution, of lack amount responsi- large governments is a for for industry ble informal difficult where particularly countries, low-income a is in is It regulations challenge. environmental global with compliance Monitoring Significance eod rc in edtesle oa betdetection object an to themselves lend kilns brick Second, nwn rcsl hr l ftebikklsi Bangladesh in kilns brick the of all where precisely Knowing c,d i.S1 Fig. Appendix, SI tfn Ermon Stefano , https://doi.org/10.1073/pnas.2018863118 d eateto at ytmScience, System Earth of Department . y raieCmosAttribution-NonCommercial- Commons Creative a,c o xml mgso kilns). of images example for . y ai .Lobell B. David , https://www.pnas.org/lookup/suppl/ f etrfor Center c,d | , f10 of 1

COMPUTER SCIENCES SUSTAINABILITY SCIENCE constructed, this manual mapping approach is also a highly ineffi- dictions to incorporate them into an expanded and improved cient way to obtain the high-frequency information necessary for training dataset (Materials and Methods). We show how our monitoring. Moreover, government data in low-income countries approach is robust to the visual, geographic diversity of satel- is often inaccurate and subject to bias (21). lite imagery. We also demonstrate its scalability. Although our Here we develop a scalable, deep-learning pipeline for accu- approach required human input during model development, this rately locating brick kilns across a broad geography. Brick kilns is a one-time fixed cost; our model is virtually free to run repeat- are visually distinct in satellite imagery, but it is expensive and edly and generate predictions across huge geographies without time-consuming to annotate images with their exact locations. further input. We make improvements on two recent papers that use machine- We then demonstrate how this map of brick kiln locations learning methods to identify brick kilns (22, 23) by presenting can be used to study compliance with environmental regula- an approach that only requires classification-level annotations tions, show how populations are exposed to brick manufacturing (indicating whether an image contains a brick kiln or not) to nationally, and examine the contribution of kilns to air pollution train a single model that performs the more difficult task of in Dhaka. detection (indicating where within an image the brick kiln is located). Building on past applications of deep-learning to envi- Results ronmental monitoring (5, 6, 23), and on specific efforts to use Identifying Brick Kilns. The model we trained on the initial, lim- deep learning to identify brick kilns (22, 23), we develop a ited dataset attained a low precision (the proportion of true weakly supervised learning approach that learns to localize kilns positives among all images predicted positive) and very low recall given lower cost classification-level data, rebuilds kilns that have (the proportion of true positives among all positives) on the been fragmented across multiple images, an inherent problem final validation set (respectively 82% and 56%; SI Appendix, Fig with satellite imagery, and distinguishes between two kiln tech- S3A). After steps to improve the model’s ability to generalize nologies, based on their shape. To produce this final model, over the entire country (Materials and Methods), we launched we conducted two main model development steps to improve the final kiln classification model over the 1.8 million images the accuracy of our approach, first training an initial model that cover Bangladesh at a spatial resolution of 1 m per pixel. on a small, limited dataset, using it to classify images from Our model predicted 7,380 images contained kilns (positive) different parts of the country, and then hand-validating the pre- and 1,785,904 did not contain kilns (negative). To evaluate the

ABConfusion Matrix for Kiln Detection Confusion Matrix for Shape Classification

PREDICTION PREDICTION Kiln No Kiln Zigzag FCK

Kiln 6547 5 Zigzag 944 96

No Kiln 833 6995 FCK 208 375

Precision: 0.887 Zigzag Accuracy: 0.908 (LB: 0.809, UB: 0.918) Recall: 0.999 FCK Accuracy: 0.643 (LB: 0.528, UB: 0.707) Accuracy: 0.942 Overall Accuracy: 0.813 (LB: 0.754, UB: 0.826) Prob. of missing a kiln: 0.07% (95% CI: [0.0088%, 0.13%])

Fig. 1. Confusion matrices and performance summary for imagery from all of Bangladesh. (A) The confusion matrix for the performance of the final model from SI Appendix, Fig. S2 launched over the imagery from all of Bangladesh, after being hand-validated by our team. Note that all 7,380 images the model detected as containing kilns (positive images) and a division-stratified sample of 7,000 nonkiln (negative images) from around 1.7 million were hand-validated. The rows indicate the actual label (kiln, no kiln), while the columns indicate the prediction from the model. The green cells indicate correct predictions, where the top left are true positives (images with kilns that were correctly predicted by the model to contain kilns) and bottom right are true negatives (images with no kilns that were correctly predicted by the model to contain no kiln). The red cells are incorrect predictions, where the top right indicates false negatives (where the model predicted an image did not contain a kiln but the image did contain a kiln) and the bottom left indicates false positives (images that contained no kiln but were predicted by the model to contain a kiln). Performance statistics (precision, recall, and accuracy are presented beneath the confusion matrix. “Accuracy” is defined as the number of images correctly identified divided by the total number of images, “precision” is defined the number of true positives divided by the total number of images predicted as positive, and “recall” is defined as the number of true positives divided by all positive images. Because the validation of negative images was performed on a sample of images, we constructed an estimate and 95% CI for the probability of missing a kiln. (B) The comparable confusion matrix for the performance of the shape classification model on a hand-validated sample of 1,623 images (note that the original sample contained 1,750 images, but 127 were dropped due to the inability to distinguish the shape during hand validation). Lower and upper bounds on the predictions were calculated by assigning the 127 “unclassifiable” images to different cells. For example, to get the lower bound for “ZZK accuracy” we assume all of the 127 images contain ZZK but were incorrectly classified by our model as FCK.

2 of 10 | PNAS Lee et al. https://doi.org/10.1073/pnas.2018863118 Scalable deep learning to identify brick kilns and aid regulatory capacity Downloaded by guest on October 1, 2021 Downloaded by guest on October 1, 2021 clbede erigt dniybikklsadadrgltr capacity regulatory aid and kilns brick identify to learning deep Scalable al. et kiln. Lee each around (Left color brown Bangladesh the , as near appear that Dhaka, bricks of drying of north rows just by surrounded kilns of cluster a 2. Fig. 61.7 CI: (95% 64% is kilns propor- hand-validated ZZK The all contain FCK. among as ZZK images for of model 127 tion our bound by the classified lower of incorrectly the were all but of get assume cells to we different accuracy” example, “ZZK the For by matrix. to calculated confusion kilns were the “unclassifiable” accuracies 127 the the on 1B). assigning (Fig bounds FCK upper for accuracy and 64.3% Lower and overall, ZZK, accuracy for 81.3% accuracy sample, achieved 90.8% conflicts hand-validated model the any classification on shape with Based our person. image, third each a images by again 127 examining resolved which in people quality), distinguished image two be poor involved not to reduced due could We was hand-validation shape which during shape (ZZK). the images, as the kilns 1,750 1,623 of of to zigzag total sample initial 25% as (an selected predictions 5,011 randomly and a fixed- as hand-validated (FCK) shape-classification 1,967 classifying kilns The kiln, each chimney technologies. for predictions kiln between made distinguish model dominant the to to two pipeline passed were the the which of total, 1A), step in (Fig. shape-classification images centroids negative 6,978 false identified 5 we and images positive true 6,547 Methods and each (Materials of coordinates kiln super- geographic weakly precise a identify the to developed within approach we vised kiln school), the a from is distance (e.g., allowable compliance regulatory calculation for see important 2,393; to 158 out CI: around that (95% missed in implies kilns details likely 7,000) model with of our images out images 1,276 (5 negative we million 0.07% 1.79 of images negatives of negatives) false negative all of approx- (ratio of 7,000 an rate out the omission obtain false can from Our we hand-validated. extrapolating all images, hand-validate by negative not did million imation we 1.79 because the run statis- model accuracy of final complete demonstrated the report as on cannot country, we tics entire Although nega- 1A. the Fig. 7,000 from in imagery 99.9% sample final the hand-validated and on Our the precision, tives) reviewer. from 88.7% third (calculated accuracy, a recall by 94.2% people resolved achieved two were model by conflicts hand-validated any was and image hand- the each across results consistency ensure validated To images. sample negative random 7,000 positive of division-stratified, 7,380 administrative all an hand-validated and we images model, final this of accuracy eas dniyn h rcs oaino rc in is kilns brick of location precise the identifying Because a fmdlgnrtdkl oriae.Ti a lt h oriae dnie yormdl(hw ngen vrstlieiaeyfor imagery satellite over green) in (shown model our by identified coordinates the plots map This coordinates. kiln model-generated of Map aeil n Methods). and Materials and i.S3 Fig. Appendix, SI .Fo the From ). n hlaDsrc ( District Khulna and ) h oenetdt n ,4 nace in normodel our in kilns unmatched 1,048 in kilns and model-predicted unmatched data 1,341 the government leaving and kilns, the 5,390 kilns 2 matched We GoB-mapped verifica- within kilns. matches and the nearest-neighbor mapping between the manual km identifying by through (20), our GoB coordinates tion the the by to number predictions true by detected model the produced the kilns on compared bound of We lower kilns. number a sam- of considered the random be a should images, govern- model hand-validating negative the on of recall that based the ple implies 1,276 because approximation Moreover, which around an kilns. miss 2,393), missing is also would to are we data 158 ment’s implies CI: model kilns (95% our traditional (n kilns of of Environment number recall of total Department The the the than by fewer mapped figure This 300 Bangladesh. around in kilns is val- 6,978 hand located model through Manufacturing. our identified idation, negatives Brick false adding of and positives Scope and Scale be could they 18 when government was the median quite by matched. the mapped fall while kilns coordinates the m), predicted to 236 the close (SD generally kiln m that identified 69 suggesting model was m, a kiln GoB between a distance and 3), average (Fig. the government predictions the found by model we mapped the kiln in comparing nearest-neighbor usually the When area, to an kilns. to in of failed kiln clusters single model dense every our for when coordinates cases kiln the also the the obtain on were demonstrates landing there which often gen- However, kilns, locations, itself. the coordinates two geolocate the to in ability shows model pipeline’s our 2 by Fig. erated coordinates. geographic to CI: (95% 66.7% becomes sample the 68.9%). ZZK, the in 64.5%, be In ZZK to of assumed 61.7%). are proportion ZZK 57.1%, kilns the CI: of 127 the (95% proportion where 59.4% the scenario “unclas- alternative becomes FCK, 127 sample be the the to when in assumed scenario 127 are first all assuming kilns the 2) sifiable” Under and extreme FCK ZZK. two are are under kilns reesti- kilns CIs 127 all we 95% assuming and this, 1) ZZK cases: address of quality, proportion To image the poor uncertainty. mated to additional due classify 127 adds not the which could for team account our not that does kilns however, estimate, This 66.5%). to ntefia tpo h ieiew ovre h centroids the converted we pipeline the of step final the In Right .Teklstesle r h e bet,wihare which objects, red the are themselves kilns The ). https://doi.org/10.1073/pnas.2018863118 fe eoigfalse removing After ,7)(20). 7,271) = PNAS | f10 of 3

COMPUTER SCIENCES SUSTAINABILITY SCIENCE 1,341 5,930 1,048 gov-mapped model-idenfied kilns matched kilns remaining kilns remaining

Government-Mapped Kilns Model-Idenfied Kilns (7,271 total) (6,978 total) Kilns Idenfied by Both

Fig. 3. Comparison of kilns identified by the Bangladesh government (7,271 total) and kilns identified by our model (6,978 total). First, we matched model- identified kilns (6,978 total) to government-mapped kilns (7,271) by identifying the nearest neighbor within 2 km. A total of 5,930 kilns were matched between both sets, leaving 1,341 unmatched kilns among the government-mapped kilns and 1,048 unmatched kilns among the model-identified kilns.

predictions (Fig 3). The number of unmatched kilns in the gov- structed without government permits. The proportion of FCK ernment data are well within the interval we would expect, given among the detected illegal kilns (that is, the model-identified our model’s recall. However, since we hand-validated all of the kilns that we could not match to a government-mapped kiln) is positive images detected by our model, the 1,048 unmatched 28.7%. This is quite similar to the proportion of FCK in the over- kilns in the model predictions are assumed to be illegal kilns con- all predictions (28.2%) but lower than the proportion of FCK

ABKilns per District Discrepancies with Government Data (by District)

26°N 26°N

25°N 25°N

24°N 24°N

23°N 23°N

22°N 22°N

21°N 21°N

88°E89°E90°E91°E92°E 88°E89°E90°E91°E92°E

Kilns Banned Kilns per District 100 200 300 400 Discrepancies with GoB Data −100 −50 0 50 100

Fig. 4. Distribution of kilns across Bangladesh and comparison to government data. (A) Plot of the number of kilns identified by our model per district. (B) Plot of the difference between kilns identified by our model and those registered with the government. Red districts indicate districts where our model identified more kilns than the government records contain, and blue districts indicate the opposite. Three districts where kilns are banned are indicated by the dashed line border (Bandarban, Khagrachhari, and Rangamati).

4 of 10 | PNAS Lee et al. https://doi.org/10.1073/pnas.2018863118 Scalable deep learning to identify brick kilns and aid regulatory capacity Downloaded by guest on October 1, 2021 Downloaded by guest on October 1, 2021 clbede erigt dniybikklsadadrgltr capacity both regulatory In aid entity. and regulated kilns each brick to identify to distance learning certain deep a distance Scalable the within mark are al. lines that vertical et dashed Lee kilns The of km. percent 10 within The percent (B) entity. the each kiln. shows from brick color kilns lightest a brick the of while regulates km km, government 1 10 the within and which percent at 5, the shows 1, color within darkest is the panels, that length rail the of 5. Fig. mandates which Act, Manufacturing (Control) Brick Establishment the Kilns with Brick compliance and in are kilns whether that 5A). kilns. pregnancies (Fig. brick of kiln to million, number a proximity the 150 of in for km occur over distributions 10 while similar within of kiln, see lives population population, We a entire the of the of km 95% of we or 1 12% 2017, within or from live million, pol- occur Bangladesh, data 18 air that over population the that pregnancies gridded to found the vulnerable Using particularly as emitted. be well lution may population as which the kilns, kilns, of to near proportion exposed the is identified that we country, entire transparent external, an kilns. of of systematic accounting value This law. the overreport- CI the highlights with an 95% comply underreporting implies that the technologies further of kiln which of bound ing predictions, in upper model reported the our ZZK exceeds of of percentage data banned the government older, fact, the In (an 7%. FCK ZZK by to of technology) relative percentage technology) the cleaner overreports we (newer, government type, kiln the the of that in registry found government we constructed to which for then shape kilns the but 1,623 hand-validated the legal adja- compared we the are When districts. in they banned registered where formally districts are government cent the kilns in some than indi- model suggesting which our 4B, data, our by Fig. predicted in that whereas kilns blue districts fewer are respectively, cates the districts Moreover, banned kilns, kilns. three 19 the registered border and 27, 1 30, data and detected government model 9, the Fig. 1, of Rangamati), report portion and southeast Khagrachhari, the darban, in discrep- districts more red and 4B far three 4B), found we Fig. (the banned, in Chittagong, ancies specifically of map districts are the hilly kilns of three where in center specifically the looked in we when district (DoE) red Environment of dark Division Department Dhaka (the the of with district registered Gazipur are the in than kilns more 77 4 found (Fig. differs we kilns of distribution the kiln the data, ment on of based sample is 28.7% full this the so hand-validate subsample, predictions. a nonvalidated not only did but results important we type is that it note However, (35.9%). to sample hand-validated the in Protected Areas Total Population AB Health Facilities h pta rcso forrslsas losu ocheck to us allows also results our of precision spatial The the across mapped kilns brick of locations precise the With govern- the to similar is kilns of number overall the Although ulndi h ahdlns.I hs he itit (Ban- districts three these In lines). dashed the in outlined Pregnancies h ecn fagvnrgltdett rpplto hthsabikkl ihnacrandsac.Frrailways, For distance. certain a within kiln brick a has that population or entity regulated given a of percent The (A) Railways Schools Forests Regulated EntityProximity toKilns 55 5100 75 50 25 0 % withkilningiven distance m5k 10km 5km 1 km .Frexample, For A). olto rm5 in oadteU mas;tersl on in result increase blows the an wind Embassy; is US the day the period, a toward time kilns such this 54 in from day pollution unfavorable most the h feto in ndaily on kilns of effect the increase average daily the of 16% to equivalent in an is daily from effect the increased pollution during This season increase) carries one-SD production wind a to brick the (equivalent where kilns 12 days additional while 4), umn esn(hnte r prtoa)on operational) brick-production are the they during (when kilns season additional from the downwind represents being lines of in effect the change between difference for The CIs sons. 95% and model 15 daily within raises kiln season one production from µg/m brick smoke find the blows We during wind Methods). km the and where (Materials embassy days effects meteorological the that fixed for when year down- controlling days and was kilns, to factors embassy fewer compared from the as rest downwind when kilns was days (the more on season from larger nonproducing wind were the year) the and of changes Dhaka March) whether (approximately to in prevailing measured season November brick-producing the Embassy then the identify We between US pollution to day. in the dataset each reanalysis at direction wind wind monitor global quality a PM and air available com- publicly the we with outcomes, locations from environmental kiln on brick light the shed bined to used be also national with compliance limited Taken respectively. very regulations. 0.5%, represents and this 6%, are 7%, together ille- are kilns The forests, areas railways, are of 5B). protected to close 9% (Fig. kilns and too facilities and constructed of illegally health school kilns 77% of to a percent close that of too found km constructed 1 We illegally within law. constructed the kilns gally of of percentage the violation identified also in we constructed entity, be each can to kilns close vio- multiple and too that Because length, kiln 5A). one (Fig. railway least law at of this to lates 22% proximate facil- are forests, health areas of protected of of 8% 13% 26% that schools, to found of model and 11% our these edu- ities, of by from each clinics, identified on km kilns and data 2 the spatial hospitals matched and from We railways forests. and km public areas, 1 protected within facilities, built cational can kiln no Protected Areas Health Facilities oudrtn hte u a fbikkl oain could locations kiln brick of map our whether understand To PM 3 2.5 9%C:01 o07;see 0.76; to 0.18 CI: (95% Railways Schools Forests .On S1 ). Table Appendix, (SI season brick the during Kiln Proximity toRegulatedEntities 55 5100 75 50 25 0 % ofkilnswithinagiven distance PM PM m5k 10km 5km 1 km https://doi.org/10.1073/pnas.2018863118 2.5 2.5 col- , S1 Table Appendix, SI PM mle yteregression the by implied A y24 by PM lt h vrg percentage average the plots 2.5 PM 2.5 costetosea- two the across , µg/m fet fklsin kilns of Effects . 2.5 3 y5.4 by i.6plots 6 Fig. . PNAS PM 2.5 2.5 | µg/m f10 of 5 data 0.47 3 .

COMPUTER SCIENCES SUSTAINABILITY SCIENCE 150 since they provide a predicted probability of whether a kiln is an FCK (banned since 2013) and could be used to target likely violators. Additionally, there is a trade-off between spatial resolution 100 and processing time. Satellite images taken at higher zoom levels cover less spatial area for a fixed image size than lower zoom lev- els (because they are more “zoomed in”), which means images are clearer but more images are necessary to cover an area, 50 increasing processing time. One potential solution could employ

PM2.5 (ug/m^3) methods that screen low-resolution images first to identify likely kiln areas then use high-resolution imagery only for those areas to make predictions, saving computational cost (24). In our results, the final coordinates we identified often landed 0 directly on the kiln, but there were instances when the model 02040 was not able to mark every single kiln, particularly in dense clus- # of kilns ters. Although we were able to address the problem of double counting by rebuilding kilns split across multiple tiles, the prob- Kilns Off Kilns On lem of splitting clusters remains another area for future work. A limitation of our image classification approach is that it cannot Fig. 6. Effect of brick kilns on daily PM2.5 during the brick-production sea- distinguish between operational and nonoperational kilns if the son. Predicted effects of daily upwind kilns within 15 km, holding all other structure of the kiln is still present. An interesting direction for variables at means, shown from the difference-in-difference regression, future research is to incorporate satellite-based information on plotted in blue for the brick-production season and red for the nonproduc- ing season. The shaded areas are 95% CIs. The difference between the lines heat or emissions into the neural network to identify operational represents the additional effect of upwind kilns during the brick production kilns. season (when they are operational). We selected a few applications to demonstrate the benefits of this approach but also ones that shed light on the regulatory land- scape for brick kilns in Bangladesh and provide information on other distances (12 and 20 km) were similar and are shown in who is affected by the harmful by-products of brick production. SI Appendix. These were meant as examples of what is possible with this type of data and demonstrated the applicability to other industrial Discussion operations or public health hazards. We have demonstrated a successful and highly accurate deep- Our results suggest that a substantial proportion of the learning approach for identifying brick kilns in satellite imagery, Bangladesh population lives in close proximity to brick kilns as well as a series of applications using the model predictions and almost the entire population lives within 10 km of a kiln. that highlight the utility of obtaining spatially precise, external This is because brick kilns tend to be clustered just outside data on an informal industry. Compared to manual approaches large urban areas where the majority of construction occurs that require either sending individuals out to collect GPS infor- in response to rapid urbanization and development. We also mation or manually scanning satellite imagery, our approach is find that adherence to the Brick Manufacturing and Brick Kilns quick, inexpensive, reproducible, and easily scalable. Although Establishment (Control) Act is low. Thirteen percent of health the model development required human involvement, now that facilities and 11% of schools are within 1 km of a brick kiln, in it has been trained and is highly accurate, it is virtually free to run violation of the Brick Manufacturing and Brick Kiln Establish- again, provided there is access to computing power and imagery. ment (Control) Act, while 75% of kilns are illegally constructed Our model can now be applied “off-the-shelf” by government too close to schools (Fig. 5B). The brick sector is no anomaly, agencies or civil society to locate kilns over large geographies as corruption and limited capacity hinder enforcement and without further human input. Our approach also has the added compliance with regulations across traffic and transportation, benefit of objectivity and transparency for independent civil soci- forests and conservation, food safety, building codes, and income ety organizations interested in environmental monitoring as it taxes (25–30). does not rely on government records, which we demonstrated By examining the contribution of brick kilns to PM2.5 con- contain inaccuracies, such as missing kilns (Fig. 3), and are some- centrations, we shed light on the consequences for the exposed times biased with respect to regulations (Fig. 4B). Additionally, populations, school, and health facilities. On days where the wind even when taking the upper bound of the shape classification blew pollution from 54 kilns toward the embassy, PM2.5 concen- predictions and assuming all 127 “hard” images were ZZK, the trations were 24 µg/m3 higher relative to days where the wind proportion of ZZK relative to (illegal) FCK in the government is did not carry any kiln pollution. This is almost equivalent to the overstated. World Health Organization guideline for 24-h PM2.5 exposure From a machine-learning perspective, we demonstrated the (25 µg/m3) coming only from kilns on a bad wind day. viability of a model that learns to perform the task of geolo- With the evidence presented on illegal proximity of kilns to cating objects from only classification-level labels, a successful schools, health facilities, and populations (including pregnan- weakly supervised learning approach. However, there were lim- cies), our findings suggest that large proportions of the popu- itations. Image quality was not always consistent, as clouds lation in Bangladesh are exposed to the air pollution generated obscured certain areas or the satellite encountered issues in cap- by kilns. This exposure is particularly troubling when considering turing imagery, causing parts of or entire images to be blurry or pregnant women and schools that are close to kilns. Pregnant missing. Although our shape classification algorithm was fairly women and school children are highly vulnerable to pollution accurate, it performed worse than the overall prediction model, exposure (31), and both in utero and early-life exposure to air which was likely influenced by image quality. Future work could pollution has been linked to poor pregnancy outcomes, poor improve the accuracy of this step in the pipeline and addressing child health status, lower academic performance and cognitive imagery quality would be an important step here. Nonethe- development, and other adverse adult outcomes (32, 33). Given less, even in its current state the shape-classification algorithm that our model missed around 1,276 kilns, the extent of expo- could provide results that are extremely useful for regulators sure and adverse impacts are likely underestimates. Ultimately,

6 of 10 | PNAS Lee et al. https://doi.org/10.1073/pnas.2018863118 Scalable deep learning to identify brick kilns and aid regulatory capacity Downloaded by guest on October 1, 2021 Downloaded by guest on October 1, 2021 clbede erigt dniybikklsadadrgltr capacity regulatory aid and kilns brick identify to learning deep Scalable al. et Lee n-ue o pcfi ak .. eetn h rsneo in.W pi the then split use is We make which kilns. of dataset, to presence larger the model detecting much a i.e., a task, allows specific on that for training fine-tuned technique from used learned features widely of a Trans- with dataset. is (38) network a learning ImageNet took the the fer specific on we initialized training by our CNN, we produced were the for means that for weights which important architecture per- approach, was VGG-16 learning architecture which a transfer VGG-16 with precision, forward the of Moving that terms task. found in We best and training size. formed of and speed architecture e.g., accuracy, Each weaknesses, testing, (37). and ResNet50 strengths and different either (36), with as comes XCeption (35), images Inception VGG- (34), (34), classify VGG-16 19 Appendix, including to architectures, (SI different with was dataset experimented and pipeline initial our the of as data task negative. this first or positive appeared to The kiln a refer S2). of We 25% Fig. satellite least image. of at nature if the the positive kiln). to as in a due image contain images an not multiple defined do we across imagery, that split be (images can images kilns “negative” con- Since 364 that coordinates and (images kiln) these images a “positive” contained 52 tain that in resulting images duplicates, satellite removed without and the vicinity from ini- nearby in identified kilns the the We Districts in of kilns. form coordinates hand-picked coordinates To and we GPS metadata. and , Bangladesh the collected Manikganj, of workers Jessore, corner part field Brahmanbaria, left spatial as data, top this included training the at were a tial of images image at coordinates million each longitude 1.8 image) reso- of and by an 1-m-per-pixel latitude covered a of The is channels approximately resolution. country values three to entire color translates with The which pixels blue lution. 17, 224 and of 5% by of level green, than 224 period zoom more red, was were the the tile that from Each images (representing Bangladesh out clouds. filtering of by 2019, region obscured May entire to 2018 the October cover to limitations. italGlobe and data Initial Pipeline. Machine-Learning Methods and Materials sectors. and industries of of implications range environmental a across and noncompliance health compliance, human in regulatory reducing for improving thus advances promise and great Ultimately, hold transparency law. a observation promoting Earth the introduce gov- and pressure enforce learning and as machine to violators well as officials disclose activities, ernment illegal publicly hiding to for disincentive used information this be organizations, society Alterna- civil could potential. of and great hands monitor the have in to violations tively, govern- detect capacity the to enhance limited ability can have of ment’s that we tools context regula- low-cost the as regulations, existing In enforce yet with low. environment, compliance is analysis the tions this and in health demonstrated human to health harm or of costs schools, the government forests, reduce sending could to of violations oversight. and address close presence verifying to as too representatives manually such built before violations, kilns algorithm likely facilities, an identify or Running to predic- FCK data). model ours our own as since to government’s such Bangladesh according the (both in and exist banned tions still been many yet have to 2013, dis- respect FCK banned with three Similarly, and built railways, tricts. be forests, facilities, of can health violations kilns schools, extensive where fairly governing con- documented human regulations organizations potential of We other society violations. and locations civil labor rights bonded precise for and child useful the cor- about be cerned by knowing would plagued example, kilns system brick For violations a (16). on to data ruption transparency transparent the external and Moreover, external introduces violations. allocate generate regarding best to priorities to ability set how or determine resources apparatus health regulatory a from strained insufficient are areas critical from km perspective. built 1 be kilns least mandating at regulations the demonstrate results our eue eplann otanacnouinlnua ewr (CNN) network neural convolutional a train to learning deep used We substantial causes manufacturing brick of mode current The con- a help could law existing of violations identifying Easily edwlae aelt mgr rmDig- from imagery satellite downloaded We 2% e n rie ormdl,oefrec aae.W etdthe tested We dataset. each for one models, four trained and set (20%) 4) 3) 2) Tangail: and 1) Sylhet in mistakes model’s the of combinations model’s different the thus and data training the of were (“mis- diversity knowledge. These examples. incorrectly the difficult expand classified or to hard model included considered the be approach, could that (41) thus images and mining takes”) identified example hard we a which used precision in We 47% overfitting. on only was achieved obtaining model it density, our 98% that kiln Bangladesh the brick northeast to and in compared terrain region in hilly different a is is which dropped Sylhet, performance in model that substantially observed We regions. contiguous Tangail l daetiae eecnandi h ape loigu orebuild to us allowing sample, the since images. splitting in multiple kiln contained across of kilns were issue the images address adjacent to regions The all helped District). contiguous regions Tangail experi- and contiguous two Division robustness of (Sylhet use covering country the the images of For parts multiple together. different hand-validated across from split clustered first are are we that kilns that ments, detect kilns to parts factors: ability distinct and two its geographically images 2) of in and well terms country perform in the and model of generalize our to ability of its robustness 1) the verified we country, experiments. noise high-quality. Robustness reduce were to data predictions the hand-verified ensure we and means “humans which data with (40), distillation as loop” data the can [known of in further process round student one dataset This conducted the model. the We expand teacher (39)]. considered distillation to the repeated is of and results which automated the dataset, be from learned expanded it this as trained then on model We team. model research new our by a corrected or verified were that labels (AugData (AugData split data We the ). S2 or of 1 Fig. Dataset Augmented Appendix, as train- dataset (SI this additional to these refer and incorporated hand- examples an We ing for and images. negatives true images negative and 1,614 positives 75,000 additional false the of and from images images a sample positive sampled additional as randomly the 301 uncovered dataset We on predictions. initial the predictions images validated the teacher 75,000 make the of from used to We total model. trained “student’s” model a a model educate for can the divisions) that model two considered “teacher” into the We and split label. size later larger to its was for it account that to 15,000 fact (except sampled division we which administrative from each Dhaka, from images 10,000 satellite sampled adjacent randomly across expansion. con- split geographically kilns Data label of to examples second obtain images. a to and regions acquire to divisions tiguous once two other predictions, conducted model’s from we the the verify data, examples for to the validation technique expand hand To expansion and of dataset. rounds data kilns the a of improving appearance utilized of our visual purpose we expand the mil- Thus, to in on needed surroundings. variation predictions we their greater country, accurate the capture make in across to to from concentrated dataset The order images were positives. diverse In of country. they true lions the were but of them examples, region of useful one 621 around provided only predicted of dataset model but the 10% initial positive, images, sampled as 176,000 randomly images of in we total 10,726 limited a When was country, diversity. dataset entire geographic the initial the and However, size set. both validation the to on us accuracy allowed which of and model. results the set the of training performance set, the machine-learning the validation assess in standard the examples on in the images predictions As from of made learned set sets. overall model validation the the and from approaches, images training negative the to maintain- (20%), positive within set of validation ratio a and the (80%) ing set training a into dataset initial esltec fteaoedtst noatann 8% n validation and (80%) training a into datasets above the of each split We incorporating tested we dataset, training effective most the construct To erntemdltandon trained model the ran We 97% and data training the on accuracy 98% reached quickly model The ain rmbt yht(S Sylhet both from cations AugData (T Tangail from tions AugData (S Sylhet from tions sample 75,000 and AugData data initial the from examples AugData val 1 .Teotu rdcin ftetahrmdlpoie initial provided model teacher the of predictions output The ). 1 1 1 1 nyeape rmAgetdDtst1 hc includes which 1, Dataset Augmented from examples only : + + + S T S xmlsfo umne aae n misclassifica- and 1 Dataset Augmented from examples : xmlsfo umne aae n misclassifica- and 1 Dataset Augmented from examples : + ocet oentoal ersnaiedtst we dataset, representative nationally more a create To T xmlsfo umne aae n misclassifi- and 1 Dataset Augmented from examples ) train 1 ) n aiainstwt h eann 20% remaining the with set validation a and ) eoelucigtemdlars h entire the across model the launching Before n agi (T Tangail and ) AugData AugData https://doi.org/10.1073/pnas.2018863118 AugData 1 1 vriae rmteSle and Sylhet the from images over noatann e ih80% with set training a into ) val 1 hsrie ocrsthat concerns raised This . PNAS AugData | f10 of 7 1

COMPUTER SCIENCES SUSTAINABILITY SCIENCE models on all four validation sets, as well as the entire contiguous regions centered around the centroid of size 224 × 224. We input each of these of Sylhet and Tangail to check their performance at scale. The model trained recentered and cropped images to the shape classification CNN to pro- on AugData1 + S performed best. We moved forward with this final train- duce the final predictions. Finally, we translated the pixel location (x,y) of ing dataset, which we call AugData2 (SI Appendix, Fig. S2). We conducted the centroid within the image to its geographic coordinates, modeling the more iterations of training to tune hyperparameters (parameters that are distance in x as a difference in longitude and the distance in y as a dif- set before training and affect aspects such as how the model learns, how ference in latitude. The entire pipeline is portrayed visually in SI Appendix, quickly, and for how long) to optimize performance gains, resulting in our Fig. S3). final model. To compare the different models we developed throughout these steps to improve the robustness and generalizability, we evaluated Evaluating final model performance. In a standard prediction problem, to each one on the same validation set (SI Appendix, Figs. S4 and S5). These evaluate the performance of an algorithm one would compare the model results demonstrate the gains in performance achieved by the training data predictions to the true values in a test dataset. During model development, expansion and robustness experiments and the high level of accuracy our we used data with underlying labels (kiln or no kiln) split into training, vali- final model obtained. dation, and test sets to evaluate the model’s performance (SI Appendix, Figs. Kiln localization. We developed a weakly supervised approach in which S4 and S5). However, our final model was run on imagery covering the coun- the model learns from the image-level classification labels of positives try, which does not come with labels. To assess the model’s performance, we and negatives to perform the harder task of localization (i.e., identify- hand-validated all images predicted as “kiln” (7,380 images) and a division- ing where within an image the kiln is located). This approach is weakly stratified, random sample of 7,000 negative predictions. We generated a supervised precisely because it does not require the training data to be confusion matrix with the results of this hand validation (Fig. 1). However, annotated with exact kiln locations, which can be very time-intensive to because we did not hand-validate all 1.79 million negative images, the accu- produce. First, we modified the architecture of the CNN. We added a global racy statistics are an estimate. To account for this uncertainty, we used the average pooling layer as in ref. 42, which takes the feature maps the false-negative rate we found in the hand-validated sample to approximate model has learned in past layers and weighs them in terms of how much the number of false negatives across the full set of negative predictions and they contribute to the final prediction. By summing the contributions, it construct a CI around the estimation. is possible to produce a class activation map (CAM), which visualizes the To do this, we used the maximum likelihood estimator to estimate the model’s signals, like a heat map of which pixels contribute most strongly false-negative rate as the mean and variance of a Bernoulli variable and to the model’s prediction (SI Appendix, Fig. S3). These clusters of acti- then constructed a 95% CI using a critical value of 1.96. Specifically, let p be vated kilns provided an approximation of where the kilns were located in a the probability of any negative image (“no kiln”) of being a false negative positive image. (image that actually contains a kiln). If in a sample of negative images of Kilns can be split across multiple satellite image tiles, resulting in the same size N, there are n false negatives, then the maximum likelihood estimate kiln’s being detected in multiple images and thus counted more than once. (MLE) of p is Therefore, we developed an approach to rebuilding split kilns by generat- n pˆ = . [1] ing “connected components,” in which spatially adjacent tiles are stitched N together to form a contiguous area. Starting from a given positive image, Also, for a large sample size, pˆ converges to p in distribution (asymptotic our approach used breadth first search to branch out to find all its neigh- distribution of the MLE), i.e., boring positive images. By stitching the images together, we also connected p activated areas from the CAMs. NI(p)(pˆ − p) ∼ N (0, 1), [2] However, the CAM output was very noisy because features in an image other than the kilns contributed to the model’s prediction. To focus the where I(p) is the Fisher information. For a Bernoulli variable, we have model signals on just the kilns, we conducted postprocessing on the con- nected components. Each pixel had a value between 0 and 255, where a 1 I(p) = , [3] higher value meant a stronger “kiln” signal. The first step filtered out weak p(1 − p) signals. We subtracted the mean pixel value from all pixels, which zeroed out the areas with low values. Then, we calculated the mean pixel value which is also the inverse of the variance. With this, we construct a 95% CI from only nonzero pixels and subtracted that value as well, leaving only the around pˆ using a critical value of 1.96 (zα/2, with α = 0.05): strongest signals in the image. We then normalized the values, such that the s lowest value was 0 and highest was 255. Next, we concentrated the signals ˆ ˆ p(1 − p) . and filtered out any remaining noise. We increased contrast to amplify the pˆ ± 1.96 × [4] N signals and passed the image through a median filter and Gaussian filter to smooth areas and remove any small patches of signals that were too small Similarly, to construct a CI for the performance of the shape detection to be a kiln. We iterated this second step of amplifying and smoothing sig- algorithm, which distinguished between FCK and ZZK, we used the same nals. Finally, since the smoothing could unintentionally expand the size of approach. If we let n be the number of ZZK in our hand-validated sample of kilns, we used a morphological operator that erodes pixels around the edges N brick kilns, then the estimated ratio of ZZK to all kilns, pˆ, can be derived of the remaining clusters of signals. The centroids of each of these clusters using the same formula (Eq. 1), and the 95% CI can be obtained using Eq. 4. provided the locations of the kilns. In cases with multiple kilns in a single image, the activated pixels could Scale and Scope of Brick Manufacturing. be conjoined as one large mass in the CAM output. We kept track of a Data. We compiled numerous sources of data to assess the scope of brick moving average of the size of these activated areas to estimate the aver- manufacturing in Bangladesh and the extent of regulatory compliance. First, age size of a kiln. When an activated area was larger than the average, we used two sources of data from the DoE of the GoB on the official number we marked more centroids. For example, when an area was twice as large of kilns. One is the aggregate number of formally registered kilns in each as the average kiln size, we collected two centroids spread evenly across district from 2017 to 2019, split out by the type of kiln and whether the kiln the pixel mass. owner secured environmental clearance (18). The other is GPS coordinates of kilns that were manually mapped by the Clean Air and Sustainable Envi- Shape classification and generating coordinates. Equipped with the centroids ronment (CASE) project (20). Second, we used globally gridded datasets on of the kiln clusters, we implemented the final step of the pipeline: clas- total population and pregnancies produced by WorldPop (43) at a 100-m sifying the shape of each kiln. First, we prepared a dataset by labeling the spatial resolution to identify exposed populations, particularly populations hand-validated positive images detected across the country with kiln shapes. of interest to policymakers. There were two classes: FCK, which are ovular, and ZZK, which are rectan- Third, we utilized publicly available data on the Humanitarian Data gular. Of the 3,345 images we labeled, 1,193 were FCK and 2,152 were ZZK, Exchange (44) to assess compliance with the GoB regulations (details on which were split into a training and validation set with the same ratio of the regulations are given below). Specifically, we used shapefiles containing FCK to ZZK. Using deep learning and transfer learning again, we trained nationally designated forests and national parks, railways, health facilities, a second CNN to classify the kiln shape, which achieved 98% accuracy on and schools, and administrative boundaries, as well as shapefiles from the the training set and 96% accuracy on the validation set. With an accu- World Database on Protected Areas (45). rate shape classification model trained, we then launched our end-to-end Finally, to assess the contribution of brick kilns to air quality, we utilized pipeline on unseen imagery covering the entire country. For each centroid publicly available PM2.5 data spanning 2017 to 2019 from the air-quality identified from the localization step of the pipeline, we cropped an image monitor at the US Embassy in Dhaka. The data contained observations every

8 of 10 | PNAS Lee et al. https://doi.org/10.1073/pnas.2018863118 Scalable deep learning to identify brick kilns and aid regulatory capacity Downloaded by guest on October 1, 2021 Downloaded by guest on October 1, 2021 clbede erigt dniybikklsadadrgltr capacity regulatory aid and kilns brick identify to learning deep Scalable al. et Lee ubro pidklsi ifrn itne—l ayn daily: varying distances—all different in daily-level kilns a upwind are of data number resulting The PM classified. km). with be dataset 50 would to kilns how (12 and embassy tions the from S6 45 distances Fig. the increasing Appendix, in both kilns (in we upwind embassy embassy, of the and of kiln upwind each 30 between was identi- bearing it kiln the each whether Given and determined model. embassy the the by between fied bearing and distance pairwise prt nyi h r,wne otsbtenNvme n March), and (kilns November production between brick months of winter timing dry, the the in in embassy variation only the direction from seasonal operate upwind wind and being in kilns day changes of each daily numbers leveraged differing in we resulted kilns, that brick PM more generated whether from pollution health test air wind and To to schools, kilns. exposure centers, population’s the city the by as reduce such to importance, is of facilities, areas in kilns geographic brick designated also their we of districts within violation hilly fell in three kilns are the brick kilns boundaries. and any of areas, whether percentage protected entity what checked forests, regulated also For each and law. of the kilns percentage the what to reg- in say exposed GoB to identified is the us kiln violate allowed distances every This minimum line, to ulations. the rail facility whether forest, checked reg- health each We and these and country. between of Khagrachari, school, distance each (Rangamati, area, the on brick districts protected calculated data no we hilly available 2008 entities, three publicly of ulated the in Policy Using up km Kiln (19). set Brick 2 Bandarbans) clinics, be the and and can hospitals per wetlands Additionally, of kilns and forests. km railways, public 1 areas, of within protected built facilities, being from educational exposed kilns the restricts identify 2013 to kilns to grid distances the varying summed total We within populations. pregnancies. on the were of WorldPop Then, matched that number from other. we total cells data each and nationally, population people of kilns of gridded km brick number the 2 to to within exposed coordinates were is kiln that who algo- kilns understand matching to to nearest-neighbor a matches coordinates used kiln that we the predictions rithm GoB, to the shape predictions by the model envi- mapped our to with manually compare data To kilns GoB our model. of our and in percentage from data reported GoB the (ZZK) the compared clearance between ronmental also difference identified We num- the the kilns and predictions. calculate district the model We per level. aggregating kilns district by of the kiln, ber to model of machine-learning type our and by district by numbers Analysis. 45 using embassy the toward ing com- wind Eq. v and (see and direction degrees u and the speed converted wind we to this, do ponents To direction embassy. affected the wind and at downwind in readings traveled the changes kilns from daily pollution utilizes whether average. determine below) daily circle to a (see to buffer data approach 15-km all analytic aggregated a and Our Dhaka for in 0.1 Embassy data US of meteorological the Source around resolution all Date spatial extracted Climate a We the from longitude. at components) (rep- data vector direction reanalysis v data and and ERA5-Land gridded speed u used wind the also by and We resented average. temperature, daily precipitation, a hourly to on aggregated were which h, 3 .E eai .E o .MDnuh Invtosfrevrnetlcompliance: environmental for “Innovations feeding McDonough, animal A. concentrated Ho, map E. D. to Benami, learning E. Deep 6. Ho, E. D. Handan-Nader, C. 5. enforcement. and monitoring environmental of economics The Shimshack, P. J. 4. regulation. environmental Enforcing Gunningham, N. compliance. and 3. Enforcement regulation: environmental Implementing Heyes, A. compliance. 2. and Enforcement stick: things Making Heyes, A. 1. ◦ ial,w sue h rmr ainl frgltn h rxmt of proximity the regulating of rationale primary the assumed we Finally, of Act (Control) Establishment Kiln Brick and Manufacturing Brick The mrigeiec n potnte”(IP oiybif tnodIsiuefor Institute Stanford brief, 2020). policy CA, Stanford, (SIEPR Research, Policy opportunities” Economic and evidence Emerging operations. Econ. Res. Rev. (2011). Econ. Regul. (1998). 50–63 etr) eageae hs ydsac odtrietenumber the determine to distance by these aggregated We sectors). efis odce netra hc nteGBrpre kiln reported GoB the on check external an conducted first We s a.Sustain. Nat. 0–2 (2000). 107–129 17, swn pe)adietfidtediywn ieto blow- direction wind daily the identified and speed) wind is 3–6 (2014). 339–360 6, 2.5 rsnsadarmta hw w ifrn iddirec- wind different two shows that diagram a presents an eprtr,wn pe,wn ieto,adthe and direction, wind speed, wind temperature, rain, , θ 9–0 (2019). 298–306 2, s = = 180 p u 2 + ◦ + 2. 180 5 n 30 and v π shge rudaesta r down- are that areas around higher is 2 arctan2(v ◦ etr.Te,w acltdthe calculated we Then, sectors. where 5, , u) .Evrn Law Environ. J. . θ x.Rv cn Pol. Econ. Rev. Oxf. swn ieto in direction wind is ◦ latitude 169–201 23, × ◦ Annu. 0.1 and [5] 14, SI ◦ J. 2 .K utkna .A eu,Z au,Priuaepluinfo rc inclus- kiln brick from pollution in Particulate Wadud, matter Z. particulate Begum, fine A. by B. pollution Guttikunda, K. Air S. Markwitz, A. 12. Hopke, K. P. Begum, A. semi-urban B. and urban 11. at aerosol atmospheric of sources of Investigation Begum, A. B. 10. pet h tnodWosIsiuefrteEvrnet(V rn) and 1522054. Grant), and Devel- (EVP 1651565 Environment Global We grants the Ser- for work. for NSF Center Institute and this Woods King of Stanford Erden, Stanford the iterations the Bora opment, earlier from Tan, with funding for Chris the assistance Tang acknowledge research and Zhongyi for imagery; and Camelo the Sintek gio accessing Christina available; with imagery support satellite the ing ACKNOWLEDGMENTS. ieieadsailaayi r vial nteHradDtvreat Dataverse Harvard the learning machine on entire available HVGW8L the are for analysis scripts https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ R spatial them. and obtain and to Python how pipeline with on along details provide data we All and bound- available administrative publicly and are manufac- temperature, aries) brick precipitation, of protected wind, All scope facilities, PM2.5, health Environment. and areas, schools, scale of forests, the distributions, Department assess (population the to turing used from data directly geospatial proprietary requested other is be kilns is brick analysis can on the data and of Bangladesh remainder GPS of the (the Government so pipeline data. The kilns), learning replicable. this predicted machine with model the all reproducible of of be output coordinates our final not an the within will of provide detected longitude also steps kilns and We data certain of latitude locations a This means pixel to pipeline. which the image learning translating metadata, as machine geospatial such our its pipeline, in of “de-geo-identified” a used stripped share imagery is to the able of are we version However, proprietary. is which Availability. Data that pollution air in interpretation causal trends the result. secular strengthening this further for inclu- of kilns, are the controls brick Additionally, to that effects unrelated not. fixed kilns are kilns should year whereas embassy only of downwind, the particular pollution sion from air in downwind for are and affect responsible that off-season, not be should brick should the they Because upwind during March, periods. time and quality two November air the between across only kilns operate upwind kilns of number the in for Results Appendix 30 km). SI a 20 using and (identified 15, km in (12, 15 distances within given various kilns a in by upwind on kilns (represented kilns upwind season brick of brick upwind effect more the of during effect day difference-in-difference causal The the isolates observations. approach daily between autocorrelation and kilns); to unrelated tion n udai oyoil fpeiiain eprtr,adwn speed; wind and temperature, precipitation, δ season of brick polynomials the quadratic and ing kilns upwind of number indicator; the between March); interaction to the (November season production day on UpwindKilns where following the estimated We days. different regression: on See kilns upwind production. identify brick to in used S6 isolate seasonality Fig. (46, of to Appendix, pollution advantage direction SI of additional wind sources the confounding used other with have from 47), pollution that PM air studies downwind of recent effect on the other kilns to of similar effect is causal the estimate to .A Blackman and A. compliance regulatory 9. better for technologies observation earth Using Purdy, R. 8. .M io .Bnm,N ros ahn erigfrevrnetlmonitoring. environmental for learning Machine Brooks, N. Benami, E. Hino, M. 7. y n te itne n 45 and distances other and Results, h tegho hsapoc isi h oprsno asta vary that days of comparison the in lies approach this of strength The ersnsaya xdefc t oto o eua rnsi i pollu- air in trends secular for control (to effect fixed year a represents esi h rae hk ein Bangladesh. region, (2013). Dhaka Greater the in ters Bangladesh. Bangladesh. in areas kilns. brick Mexican laws. environmental of enforcement Sustain. d Y ; d (48). BrickSeason 8–8 (2018). 583–588 1, X stediyPM daily the is d . d β savco fcnrl acltdfrteU mas includ- Embassy US the for calculated controls of vector a is to.Plu.Res. Pollut. Atmos. esrstenme fuwn in eaiet h embassy the to relative kilns upwind of number the measures 3 Y h eet n ot fifra etrpluincontrol: pollution sector informal of costs and benefits The al., et pidKilns Upwind d = eue iiaGoe(o aa ehoois imagery, Technologies) Maxar (now DigitalGlobe used We β nio.Dv Econ. Dev. Environ. o iga fhwcagsi iddrcinwere direction wind in changes how of diagram a for to.Environ. Atmos. 0 d + sa niao hte h a al uigtebrick the during falls day the whether indicator an is egaeul cnweg iiaGoefrmak- for DigitalGlobe acknowledge gratefully We β 1 pidKilns Upwind 2.5  d d × 58 (2013). 75–86 4, ersnsteerrtr dutdfrserial for adjusted term error the represents esrda h SEbsyi Dhaka; in Embassy US the at measured BrickSeason .Evrn Law Environ. J. 0533 (2004). 3025–3038 38, 0–2 (2006). 603–627 11, ◦ https://doi.org/10.1073/pnas.2018863118 pidscosaepeetdi the in presented are sectors upwind d + i ult to.Health Atmos. Quality Air UpwindKilns d β 2 + rc Season Brick β 98 (2010). 59–87 22, θ 3 ◦ X nEq. in d pidae r shown are area upwind + δ y d 2.5 .W etdthe tested We 6). + × d  hsapproach This . PNAS BrickSeason + d , 357–365 6, | f10 of 9 Nat. d [6] is

COMPUTER SCIENCES SUSTAINABILITY SCIENCE 13. C. Weyant et al., Emissions from South Asian brick production. Environ. Sci. Technol. 31. World Health Organization, “Effects of air pollution on children’s health and devel- 48, 6477–6483 (2014). opment: A review of the evidence” (Technical Report EUR/05/5046027, WHO Regional 14. A. Eil, J. Li, P. Baral, E. Saikawa, “Dirty stacks, high stakes: An overview of brick sector Office for Europe, Copenhagen, 2005). in South Asia” (World Bank, 2020). 32. J. Currie, J. G. Zivin, J. Mullins, M. Neidell, What do we know about short- and long- 15. J. S. Apte, M. Brauer, A. J. Cohen, M. Ezzati, C. A. Pope, Ambient PM2.5 reduces term effects of early-life exposure to pollution? Annu. Rev. Res. Econ. 6, 217–247 global and regional life expectancy. Environ. Sci. Technol. Lett. 5, 546–551 (2014). (2018). 33. J. G. Zivin, M. Neidell, Environment, health, and human capital. J. Econ. Lit. 51, 689– 16. S. P. Luby, D. Biswas, E. S. Gurley, I. Hossain, Why highly polluting methods 730 (2013). are used to manufacture bricks in Bangladesh. Energy Sustain. Dev. 28, 68–74 34. K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale (2015). image recognition. arXiv [Preprint] (2015). https://arxiv.org/abs/1409.1556 (Accessed 17. World Bank, “Introducing energy-efficient clean technologies in the brick Sector of 9 October 2020). Bangladesh” (Technical report, IBRD/World Bank, Washington, DC, 2011). 35. C. Szegedy et al., Going deeper with convolutions. arXiv [Preprint] (2014). 18. Department of Environment, “Registered brick kilns” (DOE report, Government of https://arxiv.org/abs/1409.4842 (Accessed 9 October 2020). Bangladesh, Dhaka, Bangladesh, 2019). 36. F. Chollet, Xception: Deep learning with depthwise separable convolutions. arXiv 19. Department of Environment, “National strategy for sustainable brick production in [Preprint] (2016). https://arxiv.org/abs/1610.02357 (Accessed 9 October 2020). Bangladesh” (Technical report, Dhaka, Bangladesh, 2017). 37. K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition. arXiv 20. M. M. Rana, Sources of air pollution in Bangladesh: Brick kiln & vehicle emission [Preprint] (2015). https://arxiv.org/abs/1512.03385 (Accessed 9 October 2020). scenario (Technical report, Department of Environment, Bangladesh, 2019). 38. J. Deng et al., “ImageNet: A large-scale hierarchical image database” in 2009 IEEE 21. A. Kinyondo, R. Pelizzo, Poor quality of data in Africa: What are the issues?: Politics Conference on Computer Vision and Pattern Recognition (IEEE, 2009), pp. 248–255. and policy. Polit. Pol. 46, 851–877 (2018). 39. I. Radosavovic, P. Dollar,´ R. Girshick, G. Gkioxari, K. He, “Data distillation: Towards 22. U. Nazir, U. K. Mian, M. U. Sohail, M. Taj, M. Uppal, Kiln-net: A gated neural network omni-supervised learning” in 2018 IEEE Conference on Computer Vision and Pattern for detection of brick kilns in South Asia. IEEE J. Select. Topics Appl. Earth Observ. Recognition (IEEE, 2018), pp. 4119–4128. Remote Sens. 13, 3251–3262 (2020). 40. G. Wang, Humans in the loop: The design of interactive AI systems.J. Artificial 23. G. Foody, F. Ling, D. Boyd, X. Li, J. Wardlaw, Earth observation and machine learning Intelligence Res. 64, 243–252 (2019). to meet sustainable development goal 8.7: Mapping sites associated with slavery from 41. A. Shrivastava, A. Gupta, R. Girshick, “Training region-based object detectors with space. Rem. Sens. 11, 266 (2019). online hard example mining” in 2016 IEEE Conference on Computer Vision and 24. K. Ayush, B. Uzkent, M. Burke, D. Lobell, S. Ermon, Generating interpretable Pattern Recognition (IEEE, 2016), pp. 761–769. poverty maps using object detection in satellite images. arXiv [Preprint] (2020). 42. B. Zhou, A. Khosla, A.` Lapedriza, A. Oliva, A. Torralba, Learning deep features for https://arxiv.org/abs/2002.01612 (Accessed 9 October 2020). discriminative localization. arXiv [Preprint] (2015). https://arxiv.org/abs/1512.04150 25. T. Sarker, Who bears the burden of taxes in developing countries? A case (Accessed 9 October 2020). of income taxation in Bangladesh. Pakistan Econ. Soc. Rev. 44, 181–207 43. WorldPop, Global High Resolution Population Denominators Project. https://dx.doi. (2006). org/10.5258/SOTON/WP00674. Accessed 2 April 2021. 26. R. Rezaur, Road traffic accident situation in Khulna city, Bangladesh. Proc. Eastern 44. Humanitarian Data Exchange, Bangladesh. https://data.humdata.org/group/bgd. Asia Soc. Transport. Studies 5, 65–74 (2005). Accessed 2 April 2021. 27. M. Chowdhury, M. Koike, N. Muhammed, Embracing collaborative protected area 45. UNEP-WCMC, Protected area profile for Bangladesh from the World Database of management for conservation: An analysis of the development of the forest policy of Protected Areas, April 2021. https://www.protectedplanet.net/. Accessed 2 April 2021. Bangladesh. Int. For. Rev. 11, 359–374 (2009). 46. M. A. Rangel, T. S. Vogl, Agricultural fires and health at birth. Rev. Econ. Stat. 101, 28. M. Murshed, Cutting of hills and human tragedy: An example of law enforcement 616–630 (2019). loophole in the southeastern region of Bangladesh. Int. J. Geomatics Geosci. 4, 410 47. T. Deryugina, G. Heutel, NH. Miller, D. Molitor, J. Reif, The mortality and medical (2013). costs of air pollution: Evidence from changes in wind direction. Am. Econ. Rev. 109, 29. I. Ahmed et al., Compliance to building codes for disaster resilience: Bangladesh and 4178–4219 (2019). Nepal. Procedia Eng. 212, 986–93 (2018). 48. J. Lee, et al., Replication Data for: Scalable deep learning to identify brick kilns and 30. A. Ali, Food safety and public health issues in Bangladesh: A regulatory concern. Euro. aid regulatory capacity. Harvard Dataverse. https://doi.org/10.7910/DVN/HVGW8L. Food Feed Law Rev. 8, 31–40 (2013). Deposited 9 April 2021.

10 of 10 | PNAS Lee et al. https://doi.org/10.1073/pnas.2018863118 Scalable deep learning to identify brick kilns and aid regulatory capacity Downloaded by guest on October 1, 2021