Web traffic: analysis of navigation data and modeling at single user level.

Similar documents
Configuring a Secure Access etrust SiteMinder Server Instance (NSM Procedure)

Price-Setting Auctions for Airport Slot Allocation: a Multi-Airport Case Study

ICTAP Program. Interoperable Communications Technical Assistance Program. Communication Assets Survey and Mapping (CASM) Tool Short Introduction

Enter here your Presentation Title 1

SATELLITE CAPACITY DIMENSIONING FOR IN-FLIGHT INTERNET SERVICES IN THE NORTH ATLANTIC REGION

Wake Turbulence Research Modeling

User Guide for E-Rez

Dell EMC Unisphere 360

Active Geodetic Network of Serbia

Validation of Runway Capacity Models

Systemic delay propagation in the US airport network

Atennea Air. The most comprehensive ERP software for operating & financial management of your airline

The Importance of AIM and the Operational Concept

Todsanai Chumwatana, and Ichayaporn Chuaychoo Rangsit University, Thailand, {todsanai.c;

Privacy. Newcrest means Newcrest Mining Limited (ACN ) and each of its subsidiaries; and

ELSA. Empirically grounded agent based models for the future ATM scenario. ELSA Project. Toward a complex network approach to ATM delays analysis

SERVICE ADVISORY NO.: 1506 Rev A

EUROCONTROL. Centralised Services concept. Joe Sultana Director Network Manager 1 July 2013

Analyzing Risk at the FAA Flight Systems Laboratory

Advancing FTD technologies and the opportunity to the pilot training journey. L3 Proprietary

ASPASIA Project. ASPASIA Overall Summary. ASPASIA Project

Flight Dynamics Principles, Third Edition: A Linear Systems Approach To Aircraft Stability And Control (Aerospace Engineering) By Michael V.

Regional Spread of Inbound Tourism. VisitBritain Research, August 2018

Situational Analysis. Issue: 1. Date: November 2017

PROGRESS REPORT ON PROJECT 1 Removal of Wrecks in the Traffic Separation Scheme in the SOMS

Where is tourists next destination

Evaluation of Quality of Service in airport Terminals

Carbon Offsetting and Reduction Scheme for International Aviation (CORSIA):

Airport Capacity, Airport Delay, and Airline Service Supply: The Case of DFW

Herning Airfield. Add-on for Microsoft Flight Simulator X: Steam Edition Version 1.0 (FSX:SE Edition) Manual 2016 VIDAN DESIGN

Investigation of Logistics Advantages of a Regular Container Service in the Port of Guaymas

Measurement of environmental benefits by ICAO Secretariat

CHARACTERISTICS OF TRAVELERS FROM NEW ZEALAND TO CALIFORNIA

MIS 0855 Data Science (Section 006) Fall 2017 In-Class Exercise (Day 27-28) Visualizing Network

Revenue Management in a Volatile Marketplace. Tom Bacon Revenue Optimization. Lessons from the field. (with a thank you to Himanshu Jain, ICFI)

etrust SiteMinder Agent r5.5 for BEA WebLogic 9.0 etrust SiteMinder Agent for BEA WebLogic Guide

Ticket reservation posts on train platforms: an assessment using the microscopic pedestrian simulation tool Nomad

GEOTREK, an opensource application to manage and promote hiking

Curriculum Guide. Mathcad Prime 4.0

Airport Systems: Planning, Design, and Management

NDC is a response to 3 challenges that exist in today s airline distribution eco-system:

Noise Oversight Committee

TWELFTH AIR NAVIGATION CONFERENCE

LACNIC REPORT. Ricardo Patara RSG Manager lacnic.net. RIPE 49 September 2004 Manchester

Considerations for the Long-Term Atmospheric Observing Network

EMC Unisphere 360 for VMAX

Incorporates passenger management, fleet management and revenue/cost reporting

TWELFTH AIR NAVIGATION CONFERENCE

Think the solution, experience the change

NCLB-AIM Workshop (Cairo, Egypt, September 2017) Your complete AIS/AIM Training baseline

Directional Price Discrimination. in the U.S. Airline Industry

Vista Vista consultation workshop. 23 October 2017 Frequentis, Vienna

Eurowings Aviation & Consulting Ltd.

INTERNATIONAL CIVIL AVIATION ORGANIZATION AFI REGION AIM IMPLEMENTATION TASK FORCE. (Dakar, Senegal, 20 22nd July 2011)

Experience with Digital NOTAM

EMC Unisphere 360 for VMAX

GPS For VFR: A Practical GPS Guide For VFR Pilots By Mike Meadows

Introduction Runways delay analysis Runways scheduling integration Results Conclusion. Raphaël Deau, Jean-Baptiste Gotteland, Nicolas Durand

PERFORMANCE MEASURE INFORMATION SHEET #16

Cross-sectional time-series analysis of airspace capacity in Europe

Rule Based Aircraft Performance Systems

Overview of PODS Consortium Research

AirNav Systems LLC. See aircraft on your computer screen just like on a real radar display

The System User Manual

SIMULATION OF BOSNIA AND HERZEGOVINA AIRSPACE

APPENDIX D MSP Airfield Simulation Analysis

Traffic Forecasts. CHAOUKI MUSTAPHA, Economist, International Civil Aviation Organization

Have Descents Really Become More Efficient? Presented by: Dan Howell and Rob Dean Date: 6/29/2017

2012 Performance Framework AFI

A Statistical Method for Eliminating False Counts Due to Debris, Using Automated Visual Inspection for Probe Marks

Mathcad Prime 3.0. Curriculum Guide

Analysis of rainless periods within the DriDanube project

Flight management during Concordiasi campaign

Implementation of air traffic flow management (ATFM) in the SAM Region REVIEW OF THE ATFM ACTION PLAN. (Presented by the Secretariat)

2.2 Air Navigation Deficiencies ICAO CAR/SAM AIR NAVIGATION DEFICIENCIES DATABASE SIP. (Presented by the Secretariat) SUMMARY

November 6, The Honorable Michael P. Huerta Administrator Federal Aviation Administration 800 Independence Avenue, SW Washington, DC 20591

SPADE-2 - Supporting Platform for Airport Decision-making and Efficiency Analysis Phase 2

ICAO GANP Requirements and Evolution

Business Item No XXX. Proposed Action That the Metropolitan Council approve the Coon Creek Regional Trail Master Plan.

Final Project Documentation Matt Poston 7 Dec 15

The Combination of Flight Count and Control Time as a New Metric of Air Traffic Control Activity

Implementation of PBN in Armenian airspace

LCC Competition in the U.S. and EU: Implications for the Effect of Entry by Foreign Carriers on Fares in U.S. Domestic Markets

ENHANCEMENT OF THE FAA s ON-LINE WILDLIFE AIRCRAFT STRIKE DATABASE WITH AN INTERACTIVE GRAPHICS CAPABILITY

Kristina Ricks ISYS 520 VBA Project Write-up Around the World

Managing Aeronautical Data

MINNEAPOLIS-ST. PAUL PUBLIC INPUT MEETING 3 RD QUARTER 2016 INTERNATIONAL AIRPORT (MSP)

SIMAIR: A STOCHASTIC MODEL OF AIRLINE OPERATIONS

Building adaptation in the Melbourne CBD: The relationship between adaptation and building characteristics.

Airport Characterization for the Adaptation of Surface Congestion Management Approaches*

Quality Management System (QMS)

Global Aviation Data Management (GADM) Jehad Faqir Head of Safety & Flight Operations IATA- MENA

Katya Vashchankova, Head, IATA MET Program Turbulence Impact Mitigation Workshop, 2018

ATTEND Analytical Tools To Evaluate Negotiation Difficulty

AIRLINE CONNECTION POINT ANALYSIS

California: Housing Bellwether or Not?

UNDERSTANDING NOISE COMPLAINTS

Decentralized Path Planning For Air Traffic Management Wei Zhang

Implications of the COMESA FTA and Proposed Customs Union: An Empirical Investigation

BLACK KNIGHT HPI REPORT

Transcription:

Web traffic: analysis of navigation data and moling at single user level. José Javier Ramasco 1 Santanr Octubre 2006 Marta Sánchez La Lama

Outline Internet and the Web Navigation traces Data analysis at an aggregate level Individual-level data: navigation trees Mols of Web navigation 2 Santanr Octubre 2006 Marta Sánchez La Lama

Internet and the WWW (Web) 3 Santanr Octubre 2006 Marta Sánchez La Lama

Internet and the WWW (Web) 4 Santanr Octubre 2006 Marta Sánchez La Lama

Internet and the Web Friendster.com 5 Santanr Octubre 2006 Marta Sánchez La Lama

Internet and the Web 6 Santanr Octubre 2006 Marta Sánchez La Lama

Web navigation & navigation traces http://www.a.edu http://www.b.edu 7 Santanr Octubre 2006 Marta Sánchez La Lama

Navigation traces 8 Santanr Octubre 2006 Marta Sánchez La Lama

Navigation traces (Web requests) Source MAC: 03:5a:66:17:0:5e Dest. MAC: 10::1:3f:51:2f Source IP: 12.168.3.10 Dest. IP: 127.100.251.3 Source Port: 421 Dest. Port: 80 GET /inx.html HTTP/1.1 Agent: SuperCrawler-200/beta Referer: http://www.grumpy-puppy.com/ Host: www.happy-kitty.com Santanr Octubre 2006 Marta Sánchez La Lama

Why to study navigation traces? 10 Santanr Octubre 2006 Marta Sánchez La Lama

Why to study navigation traces? 11 Santanr Octubre 2006 Marta Sánchez La Lama

Databases Emory University Stunts: 12,300 Faculty: ~ 3,200 Population: 70 k 5,6 Indiana University, Bloomington 12 Stunts: 42,000 Faculty: ~ 5,000 Santanr Octubre 2006 Population of the metro area: 5,6 million Marta Sánchez La Lama

Databases (Emory University) The database is formed by the weblogs of Emory University from Apr. 1st 2005 to Jan. 17th 2006 (41 weeks). Each click in a web of the university is registered at the time resolution of 1 second. 13 Santanr Octubre 2006 Marta Sánchez La Lama

Databases (Indiana University) The database is formed by the Web requests from a dorm of the University. Data collected from March 5, 2008 through May 3, 2008 408 million HTTP requests 1083 unique MAC addresses (Computers). 2.8 million page requests 67 unique users 630,000 Web servers 110,000 referring hosts 14 Santanr Octubre 2006 Marta Sánchez La Lama

Aggregate results 15 Santanr Octubre 2006 Marta Sánchez La Lama

Aggregate results 16 Santanr Octubre 2006 Marta Sánchez La Lama

Aggregate results 17 Santanr Octubre 2006 Marta Sánchez La Lama

Aggregate results 18 Santanr Octubre 2006 Marta Sánchez La Lama

Aggregate results 1 Santanr Octubre 2006 Marta Sánchez La Lama

Aggregate results 20 Santanr Octubre 2006 Marta Sánchez La Lama IP www.x.emory.edu/*

Individual users results 21 Santanr Octubre 2006 Marta Sánchez La Lama

Individual users results (Sessions) 22 Santanr Octubre 2006 Marta Sánchez La Lama

Mols: PageRank 23 Santanr Octubre 2006 Marta Sánchez La Lama

Mols: BookRank 24 Santanr Octubre 2006 Marta Sánchez La Lama

Mols: bookmarks + topicality (ABC) 25 Santanr Octubre 2006 Marta Sánchez La Lama

Simulation vs empirical data 26 Santanr Octubre 2006 Marta Sánchez La Lama

Simulation vs empirical data 27 Santanr Octubre 2006 Marta Sánchez La Lama

Simulation vs empirical data 28 Santanr Octubre 2006 Marta Sánchez La Lama

Simulation vs empirical data 2 Santanr Octubre 2006 Marta Sánchez La Lama

Conclusions We have studied the Web navigation traces of a large number of users. Some of the features seem to be relatively universal spite natural user-user variability. We have proposed a family of mols able to reproduce eper and eper characteristics of the users navigation patterns. 30 How far should we go? Do this last simple mol implement topicality satisfactorily? And what about real time dynamics? Santanr Octubre 2006 Marta Sánchez La Lama

Collaborators & papers 31 Santanr Octubre 2006 Marta Sánchez La Lama