Estimation of Household Waste in the Republic of Serbia using R software

Similar documents
EU PRO 2018/2019 Calendar Competition What could Serbia bring to the EU?

Perspectives of Revitalisation of High Nature Area Through Organic Apiculture

REGIONAL ASPECTS OF AGRICULTURAL INCOME LEVEL IN VOJVODINA PROVINCE IN FUNCTION OF BASIC PRODUCTION FACTORS

Solid waste generation and disposal by Hotels in Coimbatore City

SERBIA FLOODS 2014 Worst flooding in over a century

SIM Selection and peer-review under responsibility of SIM 2013 / 12th International Symposium in Management.

DEVELOPMENT OF PHARMACEUTICALS SECTOR IN SERBIA FOLLOWING WORLD TRENDS TARIFFS ELIMINATION

Big Data Processing using Parallelism Techniques Shazia Zaman MSDS 7333 Quantifying the World, 4/20/2017

Evaluation of realized investments in Belgrade s and Danube region

Discriminate Analysis of Synthetic Vision System Equivalent Safety Metric 4 (SVS-ESM-4)

Impact of Financial Sector on Economic Growth: Evidence from Kosovo

Vera Zelenović. University of Novi Sad, Novi Sad, Serbia. Dragan Lukač. Regional Chamber of Commerce Novi Sad, Novi Sad, Serbia

International Journal of Science Vol.4 No ISSN:

WHEN IS THE RIGHT TIME TO FLY? THE CASE OF SOUTHEAST ASIAN LOW- COST AIRLINES

Quarterly sample surveys on holiday and business trips

Statistical Evaluation of Seasonal Effects to Income, Sales and Work- Ocupation of Farmers, the Apples Case in Prizren and Korça Regions

Models for the statistical analysis of trends in rural tourism activity in Romania

An Econometric Study of Flight Delay Causes at O Hare International Airport Nathan Daniel Boettcher, Dr. Don Thompson*

Tourist Traffic in the City of Rijeka For the Period Between 2004 and 2014

GEOGRAPHY OF GLACIERS 2

An Analysis Of Characteristics Of U.S. Hotels Based On Upper And Lower Quartile Net Operating Income

Estimating Sources of Temporal Deviations from Flight Plans

NAPA VALLEY VISITOR INDUSTRY 2012 Economic Impact Report

Controlled Cooking Test (CCT)

Airspace Complexity Measurement: An Air Traffic Control Simulation Analysis

SOME MOTIVATIONAL FACTORS THAT DETERMINE ROMANIAN PEOPLE TO CHOOSE CERTAIN TRAVEL PACKAGES

Validation of Runway Capacity Models

Mathcad Prime 3.0. Curriculum Guide

Cross-sectional time-series analysis of airspace capacity in Europe

THE DISINTEGRATION OF SETTLEMENTS IN BOSNIA AND HERZEGOVINA THE EXAMPLE OF SARAJEVO/EAST SARAJEVO

ACCOMMODATION from the supply side

Daily Estimation of Passenger Flow in Large and Complicated Urban Railway Network. Shuichi Myojo. Railway Technical Research Institute, Tokyo, Japan

FORECASTING OF INDUSTRIAL ROUNDWOOD PRODUCTION FOR THE PART OF SOUTH-EAST EUROPE. Maja Moro, Darko Motik, Denis Jelačić, Marek Drimal

Impact Evaluation of a Cluster Program: An Application of Synthetic Control Methods. Diego Aboal*, Gustavo Crespi** and Marcelo Perera* *CINVE **IDB

Tourist flow in Italy Year 2016

NAPA VALLEY VISITOR INDUSTRY 2014 Economic Impact Report

Marian ZAHARIA Petroleum-Gas University, Ploiesti, Romania

Temporal Deviations from Flight Plans:

Global Aerospace & Defense Market Report

Tourist flow in Italy Year 2017

Thinking With Mathematical Models Invs. 4.3, Correlation Coefficients & Outliers. HW ACE #4 (6-9) starts on page 96

City tourism: a successful product

Somchanok Tiabtiamrat* and Supachok Wiriyacosol ABSTRACT

CORRIDORS: DEVELOPMENT OPPORTUNITY OF SERBIA*

LUCRĂRI ŞTIINŢIFICE, SERIA I, VOL. XIII (4) ECONOMIC STATISTICS OF SPECIFIC RESOURCES AND ACTIVITIES OF RURAL TOURISM

Predicting Flight Delays Using Data Mining Techniques

Modeling Air Passenger Demand in Bandaranaike International Airport, Sri Lanka

Baku, Azerbaijan November th, 2011

SERBIA BOSILEGRAD. Theme covered: Effective participation Affected minorities: Bulgarians

Demand Shifting across Flights and Airports in a Spatial Competition Model

THE INFLUENCE OF TRANSIT TOURISTS TOWARDS THE DEVELOPMENT OF HOSPITALITY IN THE SOUTHEASTERN REGION

Tari s versus Non-Tari Barriers

Abstract. Introduction

Original scientific paper UDC: 911.2:551.58(497.11) DOI: /IJGI S ANALYSIS OF ANNUAL SUMS OF PRECIPITATION IN SERBIA

VISIT BALTIMORE TOURISM WORKFORCE STUDY

PREFERENCES FOR NIGERIAN DOMESTIC PASSENGER AIRLINE INDUSTRY: A CONJOINT ANALYSIS

Concept, Method, Challenges. Jürgen Weiß. MA in Tourism Economics (University of applied Sciences Vienna)

An Analysis of Intra-Regional Air Travel in SAARC Region

Forecasting effects of weather extremes: El Nino s influence maize yields in Mexico

AIR FORCE INSTITUTE OF TECHNOLOGY

Ecomuseums and sustainable tourism: experiences of the Piedmont Region (I)

PRESS RELEASE No. 24 of February 3, 2014 Tourism December and the Year 2013

Capacity Building Programme on Tourism Statistics Workshop IV Vienna (AT), 18-20/11/2009

Impacts of Visitor Spending on the Local Economy: George Washington Birthplace National Monument, 2004

NAPA VALLEY VISITOR INDUSTRY 2016 Economic Impact Report

THE PERFORMANCE OF DUBLIN AIRPORT:

The Effects of GPS and Moving Map Displays on Pilot Navigational Awareness While Flying Under VFR

Economic Impact of Kalamazoo-Battle Creek International Airport

Analysis of Transit Fare Evasion in the Rose Quarter

Improving Statistical Capacities of Tourism in Turkey

THE FESTIVALS AS A TOOL ON OHRID TOURISM DESTINATION BRANDING

The Importance of Promoting a Rural Touristic Destination: The Case of Racoş Village

Introduction on the Tourism Satellite Account

Uncertainty in the demand for Australian tourism

Glacial lakes as sentinels of climate change in Central Himalaya, Nepal

Wake Turbulence Research Modeling

Newbury Population ,071 1,941 3,719 4,038 5,337 5,611 5,805

Revisiting the Relationship between Competition and Price Discrimination

Tourist motives and activities as drivers of tourist satisfaction among men and women

To Study the Relationship between Service Quality Tourist Satisfaction and Revisit Intension

The Impact of Bankruptcy on Airline Service Levels

REPORT. VisitEngland Business Confidence Monitor Wave 5 Autumn

ALLOMETRY: DETERMING IF DOLPHINS ARE SMARTER THAN HUMANS?

UNDERSTANDING TOURISM: BASIC GLOSSARY 1

IMPORTANT - PERFORMANCE ANALYSIS AS A TOOL IN DESTINATION MARKETING

MIT ICAT. Fares and Competition in US Markets: Changes in Fares and Demand Since Peter Belobaba Celian Geslin Nikolaos Pyrgiotis

Survey into foreign visitors to Tallinn Target market: Cruise voyagers. TNS Emor March 2012

Mapping the Snout. Subjects. Skills. Materials

Seasonal Adjustment with the R packages x12 and x12gui

2011 Census of Population, Households and Dwellings in the Republic of Serbia. Press conference 29 November 2012

WORLD TOURISM ORGANIZATION (UNWTO) Final Report WORKSHOP ON TOURISM STATISTICS (SESRIC-TR MINISTRY OF CULTURE AND TOURISM-TURKSTAT-UNWTO)

Available online at ScienceDirect. Procedia Economics and Finance 6 ( 2013 )

ABOUT SRBOBRAN. No of Companies in Srbobran. No of Entrepreneurs in Srbobran

2011 Census of Population, Households and Dwellings in the Republic of Serbia DISABILITY. Briefing

VUSALA EYNULLAYEVA 2368 MANAGEMENT CONSULTING LABS FALL 2016

Estimates of the Economic Importance of Tourism

Simulation of disturbances and modelling of expected train passenger delays

MEASUREMENT OF ACCELERATION Pre-Lab. Name: Roster#

Queenstown Lakes District Council. Review of District Plan Business Zones Capacity and Development of Zoning Hierarchy

Predicting a Dramatic Contraction in the 10-Year Passenger Demand

Transcription:

Estimation of Household Waste in the Republic of Serbia using R software Melinda TOKAI (melinda.tokai@stat.gov.rs) Statistical Of ce of the Republic of Serbia ABSTRACT This paper deals with the problem of estimation of annual amount of waste generated by households in Republic of Serbia. Waste generated by households is a part of municipal waste that also includes waste generated by trade and services activities as well as by tourists. In order to estimate pure household waste, regression analysis was preformed with reference to Cammarota et al. (2005) A proposal for the estimation of household waste. In order to face this problem, regression models were constructed for municipal waste that are based on non domestic variables which are related to trade and services activities and tourism. The part of the municipal waste that could not be explained by a model based on non domestic variables was ascribed to pure household waste. In order to check validity of results, the model residuals were then related to domestic variables (usual population and the average number of inhabitants per occupied dwelling). The regression models were tted using R software. Keywords: Household waste, regression models, R software JEL Classi cation: C10, C88 INTRODUCTION The aim of this paper is to present the way household waste was estimated for reference year 2014 in SORS (Statistical Of ce of the Republic of Serbia). The need for this estimation came out of the Eurostat s demand to provide data on household waste amount in Serbia, so this was a rst time in SORS that someone dealt with this issue with a notable time constrain. The key problem when estimating household waste in Serbia is that household waste is not directly detectable because municipal waste contains an unidenti ed amount of waste not generated by households. This paper presents the method how municipality waste was modeled with variables that are related to trade and services activities and tourism through linear regression. The part of municipality waste that could not be explained with this model is considered to be the pure household waste. Romanian Statistical Review nr. 2 / 2016 59

The collection system of municipal waste in Serbia is organized over the municipal territory with waste containers available to everyone with no mean to distinguish between different possible users (trade and services activities located near or in residential buildings). These containers are located along streets and near residential areas. Then municipalities report the entire amount of waste collected including both household waste and waste coming from trade and service activities. Moreover, tourists who spend their holydays in given cities can put their waste in the same containers. In order to face this problem, a simple regression model was set to estimate pure household waste starting from selected variables correlated with municipal waste. Models are tted including non domestic variables; the outcoming residuals are then related to domestic variables in order to check results validity. METHOD Based on available data, we tried to identify a model which would allow estimation of the municipal waste amount that can be assigned uniquely to households. For this purpose we consider variables representing the most relevant sources of municipal waste, distinguishing them into non domestic and domestic variables. The non domestic variables are related to trade and services activities and tourism. We represented trade and services activities with the number of trade and services employees per inhabitant. As far as the tourism is concerned, it is represented by the ratio of tourist overnights stays and usual population. Two variables are used to account for domestic waste s sources: usual population and the average number of inhabitants per occupied dwelling. The second indicator allows accounting for people not registered as usual residents but living in occupied dwellings. This last indicator is built on data coming from the Census of population in Serbia, in which information on occupied and unoccupied dwellings are collected. The part of the municipal waste that could not be explained by models based on non domestic variables can be ascribed to pure household waste. As a validity check, the model residuals are than related to domestic variables. Analyzing the data which were aggregated on a county level, it has come to our attention that the recorded data for the City of Belgrade differs signi cantly (see bar plot on Figure 1) from the rest of the data. Belgrade s contribution in the total value of the observed variable (municipality waste) is nearly 25% (see pie chart on Figure 2), which is much more then for the other counties. Because of this, we decided to create two regression models for the estimation of the household waste: one for all the counties except the City of Belgrade and one for the Belgrade municipalities. 60 Romanian Statistical Review nr. 2 / 2016

par(mar = c(4, 4, 1, 1), mgp = c(2, 1, 0), cex = 0.6) barplot(okrug$kom.3/1000, col='skyblue',names.arg = okrug$okr, ylab='municipality waste in \n thousands of tones', ylim=c(0,600), xlab='municipality code',) Municipality waste amount Figure 1 municipality waste in thousands of tones 0 100 300 500 00 02 04 06 08 10 12 14 16 18 20 22 24 municipality code par(mar = c(1,1,5, 1), mgp = c(2, 1, 0), cex = 0.8) pie(x=okrug$kom.3,init.angle=90,clockwise = T,labels=nazokr,cex=0.6, col=gray.colors(25,0.99,0.00001),lty=1, border='green4',lwd=6) Pie chart of municipality waste amount NISAVSKI PCINJSKI JABLANICKI PIROTSKI TOPLICKI GRAD BEOGRAD Figure 2 RASINSKI RASKI MORAVICKI ZLATIBORSKI SEVERNO BACKI ZAJECARSKI BORSKI POMORAVSKI SUMADIJSKI BRANICEVSKI PODUNAVSKI KOLUBARSKI MACVANSKI SREMSKI SREDNJE BANATSKI SEVERNO BANATSKI JUZNO BANATSKI ZAPADNO BACKI JUZNO BACKI Romanian Statistical Review nr. 2 / 2016 61

ESTIMATION PROCEDURE As mentioned before, separate linear regression models were tted for Serbian counties excluding the City of Belgrade and another for the Belgrade municipalities. More precisely, the weighted least squares method for tting the best linear regression models was used. For this purpose the lm() function in the stats R package with a given weights argument was used. The rst model for all the counties except Belgrade is given by: logy i = α+β 1 x 1i +β 2 x 2i +ε i i = 1,...,n (1) where n is the number of counties (except Belgrade) (n = 24), y i is the amount of collected municipal waste in the counties (in tones), x 1i is the number of trade and service employees per inhabitant and x 2i is the ratio of tourist overnights stays and usual population. The model summary is in the Table 1. okrug1=okrug[!okrug$nokr%in% "GRAD BEOGRAD",] row.names(okrug1)=1:24 # linear model for Serbian counties excluding Belgrade srb_bezbg=lm(log(kom.3)~i(zapos^2/br.stan)+i(nocenja/br.stan), data=okrug1, weights=okrug1$br.stan/sum(okrug1$br.stan)) Regression summary for model (1): municipality waste amount for Serbian counties without Belgrade Table 1 Model (1) Coef cients Estimate Standard Error t-value ˆ 1.084e+01 8.792e-02 123.311 ˆ1 6.610e-05 7.451e-06 8.871 ˆ2 6.758e-03 6.052e-02 0.112 Residual standard error = 0.05154 on 21 degrees of freedom Multiple R-Squared=0.791 Once the linear model was tted, the pure household waste was estimated as: e logy i ˆβ 1 x 1i ˆβ 2 x 2i For most of the counties the estimated pure household waste is approximately 70% of the municipal waste, which is in accordance with the estimates of the neighboring countries with similar economic situation where 62 Romanian Statistical Review nr. 2 / 2016

pure household waste makes 60-85% of the total municipal waste. According to this model, for some counties the resulting estimate of the part of the municipality waste that can be ascribed to pure household waste is notably below (< 20%) or above (> 90%) the expected value. This is probably due to de ciencies of the data on which the model was based. For these counties imputation was applied, instead of the resulting estimates, the average of all the estimated values was used. #average of the part of the municipality waste that can be ascribed #to pure household (in all the counties except Belgrade) sum(exp(residuals(srb_bezbg)+coef(srb_bezbg)[1]))/sum(okrug1$kom.3) ## [1] 0.7025566 #part of the municipality waste that can be ascribed to pure household exp(residuals(srb_bezbg)+coef(srb_bezbg)[1])/okrug1$kom.3 ## 1 2 3 4 5 6 7 ## 0.7461787 0.8403478 0.8572623 0.7684268 0.8461123 0.1908959 0.7529099 ## 8 9 10 11 12 13 14 ## 0.7797441 0.8220857 0.8738667 0.8425341 0.7018723 0.8183730 0.8776869 ## 15 16 17 18 19 20 21 ## 0.8827785 0.7465805 0.7788016 0.7549611 0.8415400 0.5775235 0.9291225 ## 22 23 24 ## 0.9282892 0.8514118 0.8671652 As it can be seen from the previus output, results are somewhat suspicious for couinties Juzno Backi(6), Nisavski(20), Toplicki(21), Pirotski(22). There is were imputation come in handy. #average of the part of the municipality waste that can be ascribed #to pure household waste(in all the counties except Belgrade) a=sum(exp(residuals(srb_bezbg)+coef(srb_bezbg)[1]))/sum(okrug1$kom.3) #part of the municipality waste that can be ascribed to pure household waste srb_bezbg_hw=exp(residuals(srb_bezbg)+coef(srb_bezbg)[1])/okrug1$kom.3 srb_bezbg_hw[c(6,20,21,22)]=a #part of the municipality waste that can be ascribed to pure #household waste after imputation srb_bezbg_hw ## 1 2 3 4 5 6 7 ## 0.7461787 0.8403478 0.8572623 0.7684268 0.8461123 0.7025566 0.7529099 ## 8 9 10 11 12 13 14 ## 0.7797441 0.8220857 0.8738667 0.8425341 0.7018723 0.8183730 0.8776869 ## 15 16 17 18 19 20 21 ## 0.8827785 0.7465805 0.7788016 0.7549611 0.8415400 0.7025566 0.7025566 ## 22 23 24 ## 0.7025566 0.8514118 0.8671652 Romanian Statistical Review nr. 2 / 2016 63

For the City of Belgrade among all the tted models the municipal waste was best estimated with the linear regression model given by: logy i = a+β 3 x 3i +β 4 x 4i +e i i = 1,...,n (2) where n is the number of Belgrade municipalities (n = 17), y i is the amount of collected municipal waste (in tones), x 3i is the number of trade and service employees and x 4i is the number of tourist overnights stays. The model summary is in Table 2. #linear model for the municipalities of Belgrade Bg=lm(log(kom.3)~zapos+nocenja, data=beograd, weights=beograd$br.stan/sum(beograd$br.stan)) # the estimated part of the municipality waste in Belgrade that can be #ascribed to the pure household waste (bg_hw=sum(exp(residuals(bg)+coef(bg)[1]))/sum(beograd$kom.3)) ## [1] 0.6914103 Regression summary for model (2): municipality waste amount of Belgrade munici- pality Table 2 Model (2) Coef cients Estimate Standard Error t-value â 1.006e+01 1.813e-01 55.457 ˆ3 ˆ4 3.268e-05 8.716e-06 3.750-6.274e-06 2.333e-06-2.689 Residual standard error = 0.1072 on 14 degrees of freedom Multiple R-Squared=0.5222 The pure household waste is estimated as e logy i ˆβ 3 x 3i ˆβ 4 x 4i The estimated pure household waste in the City of Belgrade is approximately 69% of the municipal waste. We came to a conclusion that from 302 kilos of municipal waste per inhabitant, 76% is pure household waste that is 230 kilos. For the City of Belgrade this is a bit smaller 69% which can be explained with a large number of tourist overnights stays and the large number of trade and service employees. 64 Romanian Statistical Review nr. 2 / 2016

VALIDITY CHECK In order to validate the proposed models, we created another linear regression model where the response variable is the estimated amount of pure household waste and the predictors are the average number of inhabitants per occupied dwelling and the usual population. We should stress it here that the validation of the results was done on the county level with the full data (Belgrade is included as a county). rezultat=data.frame(okrug[,c(1,2,4,5,6,8,9)],koef=c(bg_hw,srb_bezbg_hw)) rezultat$domaci=rezultat$koef*rezultat$agen.otpad colnames(rezultat) ## [1] "okr" "nokr" "br.stan" "zapos" "nocenja" ## [6] "kom.otpad" "agen.otpad" "koef" "domaci" par(mar = c(4, 4, 4, 1), mgp = c(2, 1, 0), cex = 0.8) plot((rezultat$br.stan/1000),(rezultat$domaci/1000), xlab='usual population in thousands', ylab='household waste in thousands of tones') text(rezultat$br.stan/1000,rezultat$domaci/1000, labels=naz,cex=1,pos=2) Scatter plot of the relationship between household waste and number of usual popu- lation by counties Figure 3 household waste in thousands of tones 50 100 150 200 250 300 350 JUZNO BACKI GRAD BEOGRAD 500 1000 1500 usual population in thousands par(mar = c(4, 4, 4, 1), mgp = c(2, 1, 0), cex = 0.8) plot(rezultat$br.stan[-c(1,7)]/1000,rezultat$domaci[-c(1,7)]/1000, xlab='usual population in thousands', ylab='household waste in thousands of tones') Romanian Statistical Review nr. 2 / 2016 65

From the plots on Figure 3 and Figure 4 it is obvious that a very strong linear relationship exist between the household waste amount and the usual population. The regression model is given by: z i = b+β 5 x 5i +β 5 x 5i +ξ i i = 1,...,n (3) where z i is the estimated amount of pure household waste (in tones), x 5i is the average number of inhabitants per dwelling and x 6i is the usual population. This model s summary is in Table 3. Scatter plot of the relationship between household waste and number of usual population by counties (a closer look - without City of Belgrade and Juzna-Backa) Figure 4 household waste in thousands of tones 20 30 40 50 60 70 80 100 150 200 250 300 350 usual population in thousands Regression summary for model (3): valdity check Coef cients Estimate Standard Error t-value ˆb -4.823e+03 5.721e+03-0.4083 ˆ5 ˆ6 4.022e+03 1.905e+03 2.111 2.046e-01 1.989e-03 102.886 Residual standard error = 0. 3013 on 22 degrees of freedom Multiple R-Squared=0. 9979 Table 3 The negative sign of the intercept in model (3) is most likely due to the wide range of the response variable. 66 Romanian Statistical Review nr. 2 / 2016

CONCLUSION Due to Serbian collection system of municipal waste information data related to pure house- hold waste in not available. Starting from the total amount of municipal waste in the Serbian counties we presented a simple procedure to estimate the pure household waste considering only the selected non domestic variables. Note, that this was the rst time for SORS to conduct this kind of estimation and this was the best we could come up with in the given time, but there are plans in the future to continue model development. References 1. Cammarota M., Jona Lasinio G., Di Sarro T., 2005, A proposal for the estima- tion of household waste, Atti del Convergno intermedio SIS 2005, Statistica e ambiente, Messina, 21-23rd September 2005, pp. 215-218 2. Cammarota M., Jona Lasinio G., Di Sarro T., 2006, Methods for the Analysis and Estimation of Household Waste 3. R Core Team, 2015, R: A language and environment for statistical computing, R Foun- dation for Statistical Computing, Vienna, Austria. https://www.r-project.org/ Appendix library(xlsx) otpad=read.xlsx("../data/komunalni(2).xls", sheetindex=1, colindex=c(1,2,3,4,6), startrow=2, endrow=206, stringsasfactors=f) colnames(otpad)=c('mbops','opstina', 'kom.otpad','br.stan','agen.otpad') otpad$kom.otpad=as.numeric(otpad$kom.otpad) otpad$kom.otpad=round(otpad$kom.otpad) otpad=otpad[substr(otpad$mbops, 1,2)!='RS',] otpad=otpad[substr(otpad$opstina,1,4)!='grad',] #identifikacije okruga library(sas7bdat) sifrarnik=read.sas7bdat('../data/ops_2014.sas7bdat') sifrarnik$okr=as.character(sifrarnik$okr) sifrarnik$okr=as.character(sifrarnik$okr) sifrarnik$nokr=as.character(sifrarnik$nokr) sifrarnik=sifrarnik[c(1:122, 124:169),c(1,5,6)] #dodajemo id. okr otpad=merge(otpad,sifrarnik,by='mbops') okr=c(paste0(0, as.character(0:9)), as.character(10:24)) nokr=c() for(i in c(paste0(0, as.character(0:9)), as.character(10:24))){ nokr=c(nokr, sifrarnik$nokr[min(which(sifrarnik$okr==i))]) } # agregimo podatke na nivou okruga okrug=data.frame(cbind(okr, nokr)) for(i in 1:25){ okrug$br.stan[i]=sum(otpad$br.stan[otpad$okr==okr[i]]) okrug$kom.otpad[i]=sum(otpad$kom.otpad[otpad$okr==okr[i]]) } Romanian Statistical Review nr. 2 / 2016 67

###################################################################### #prosecna kolicina otpada po okruzima okrug$koef=okrug$kom.otpad/okrug$br.stan otpad2=merge(otpad, okrug[,c(1,5)], by='okr') otpad2$kom.2=otpad2$koef*otpad2$br.stan #mbops,brzap, nocenja pomocni=read.xlsx("../data/otpad_modelnov_2014.xls", sheetindex=2, colindex=c(1,3,6),endrow=169) otpad2=merge(otpad2, pomocni, by='mbops') library(data.table) setnames(otpad2, 'totbr_zaposlenih','zapos') setnames(otpad2,'br_nocenja_turista','nocenja') otpad2=otpad2[,c(1,3,2,7,6,4,8,5,10,11,9)] ###################################################################### #imputacije zbog NA u nocenju# okrug$min=sapply(okrug$okr, function(x) return(min(otpad2$nocenja[otpad2$okr==x], na.rm=t))) otpad2=merge(otpad2,okrug[,c(1,6)], by='okr') otpad2$nocenja[is.na(otpad2$nocenja)]=otpad2$min[is.na(otpad2$nocenja)] #u Toplickom okrugu samo je za jednu opstinu dostupan podatak #(Kursumlija - Djavolja varos) otpad2$nocenja[otpad2$okr==21 & otpad2$mbops!='70688']=min(okrug$min) otpad2=otpad2[,c(2,3,1,4,5,6,7,8,9,10,11,12)] save(otpad2,file="otpad.rda") ###################################################################### okrug$agen.otpad=sapply(okrug$okr, function(x) return(sum(otpad2$agen.otpad[otpad2$okr==x]))) okrug$zapos=sapply(okrug$okr, function(x) return(sum(otpad2$zapos[otpad2$okr==x]))) okrug$nocenja=sapply(okrug$okr, function(x) return(sum(otpad2$nocenja[otpad2$okr==x]))) okrug$kom.2=sapply(okrug$okr, function(x) return(sum(otpad2$kom.2[otpad2$okr==x]))) okrug=okrug[,c(1,2,5,3,8,9,6,4,7,10)] okrug$a.otpad.stan=okrug$agen.otpad/okrug$br.stan okrug$kom.3=((okrug$koef+okrug$a.otpad.stan)/2*okrug$br.stan) save(okrug,file="okrug.rda") ##################################################################### okrug1=okrug[-1,] ####SRB BEZ BG srb_bezbg=lm(log(kom.3)~i(zapos^2/br.stan)+i(nocenja/br.stan), data=okrug1, weights=okrug1$br.stan/sum(okrug1$br.stan)) summary(srb_bezbg) sum(exp(residuals(srb_bezbg)+coef(srb_bezbg)[1]))/sum(okrug1$kom.3) exp(residuals(srb_bezbg)+coef(srb_bezbg)[1])/okrug1$kom.3 68 Romanian Statistical Review nr. 2 / 2016

#za 6,20,21,22 uzimam prosek (a=sum(exp(residuals(srb_bezbg)+coef(srb_bezbg)[1]))/sum(okrug1$kom.3)) (srb_bezbg_hw=exp(residuals(srb_bezbg)+coef(srb_bezbg)[1])/okrug1$kom.3) srb_bezbg_hw[c(6,20,21,22)]=a srb_bezbg_hw ### Beogradske opstine Beograd=otpad2[otpad2$okr=='00',] save(beograd,file="beograd.rda") Beograd$kom.3=((Beograd$koef+Beograd$agen.otpad/Beograd$br.stan)/2)*Beograd$br.stan Bg=lm(log(kom.3)~zapos+nocenja, data=beograd, weights=beograd$br.stan/sum(beograd$br.stan)) summary(bg) (bg_hw=sum(exp(residuals(bg)+coef(bg)[1]))/sum(beograd$kom.3)) ######################################################################## # rezultat rezultat=data.frame(okrug[,c(1,2,4,5,6,8,9)],koef=c(bg_hw,srb_bezbg_hw)) write.xlsx(rezultat, 'C:/Melinda/otpad/v_2/rezultati.xls', showna=f, sheetname='rezultat',col.names=t, row.names=f) rezultat$domaci=rezultat$koef*rezultat$agen.otpad # plot za odnos za proveru rezultata naz=c("grad BEOGRAD", rep("",5),"juzno-backi", rep("",18)) plot(rezultat$domaci~rezultat$br.stan, main='odnos domacinskog otpada i broja stanovnika \n po okruzima', xlab='broj stanovnika', ylab='kolicina otpada iz domacinstava') text(rezultat$br.stan,rezultat$domaci, labels=naz,cex=0.5,pos=2) plot(rezultat$domaci[-c(1,7)]~rezultat$br.stan[-c(1,7)], main='odnos domacinskog otpada i broja stanovnika \n po okruzima', sub='kada se Uklone Beograd i Juzna Backa', xlab='broj stanovnika', ylab='kolicina otpada iz domacinstava') summary(lm(domaci~br.stan, data=rezultat)) plot(residuals(lm(domaci~br.stan, data=rezultat))) rezultat$dom=rezultat[,3]/rezultat[,10] save(rezultat, file="rezultat.rda") summary(lm(domaci~dom+br.stan, data=rezultat)) plot(residuals(lm(domaci~dom+br.stan, data=rezultat))) plot(rezultat$domaci~rezultat$dom) Romanian Statistical Review nr. 2 / 2016 69