CSV

The rawdata.csv profile indicates the traffic analysis based mobility patterns. we extract human trips from Call Records Detail data. Combining traffic analysis zone dataset, we map each trip record to the zones with the same origin zones and destination zones. After  this, we can obtain this dataset. This dataset stores the hourly number of departure and arrival trips in each traffic analysis zone.

The POI-importance.csv profile indicates the term frequency-inverse doument frequency(TF-IDF) of each category of poi the in each traffic analysis zone.

  • Transportation
  • Last Updated On: 
    Thu, 06/06/2019 - 03:05

    Normal
    0

    21

    false
    false
    false

    PT-BR
    X-NONE
    X-NONE

  • Communications
  • Last Updated On: 
    Wed, 06/05/2019 - 22:25

    We introduce a benchmark of distributed algorithms execution over big data. The datasets are composed of metrics about the computational impact (resource usage) of eleven well-known machine learning techniques on a real computational cluster regarding system resource agnostic indicators: CPU consumption, memory usage, operating system processes load, net traffic, and I/O operations. The metrics were collected every five seconds for each algorithm on five different data volume scales, totaling 275 distinct datasets.

  • Standards Research Data
  • Last Updated On: 
    Thu, 06/06/2019 - 13:58

    Video dataset of 102 participants for the paper "Learning deep representations for video-based intake gesture detection"

  • Health
  • Last Updated On: 
    Thu, 05/16/2019 - 04:41

    Archival bundle of District Information System for Education (DISE) Delhi primary to upper-primary level schools in academic session 2012-2013. DISE is a school-level dataset consisting of government-recognized schools. It is a joint initiative of the Government of India, UNICEF and the National University of Education and Planning (NUPEA).

  • Education
  • Last Updated On: 
    Sat, 04/06/2019 - 01:33

    Matlab Simulink was used to develop an emulator for the Viessmann Vitorond 200 Gas Fired Boiler VD2 Series 380 and a series of faults were modeled along with normal data across the expected range of operation to create a labelled dataset with approximately 27,500 cases for training and testing boiler fault classification models. 

  • Energy
  • Last Updated On: 
    Sat, 03/30/2019 - 15:08

    Empirical line methods (ELM) are frequently used to correct images from aerial remote sensing. Remote sensing of aquatic environments captures only a small amount of energy because the water absorbs much of it. The small signal response of the water is proportionally smaller when compared to the other land surface targets.

     

    This dataset presents some resources and results of a new approach to calibrate empirical lines combining reference calibration panels with water samples. We optimize the method using python algorithms until reaches the best result.

     

  • Sensors
  • Last Updated On: 
    Fri, 03/08/2019 - 20:47

    The dataset is used in the paper entitled "A distributed Fog node assessment model by using Fuzzy rules learned by XGBoost" as fuzzy rules extracted by XGboost

  • Communications
  • Last Updated On: 
    Sun, 03/03/2019 - 06:46

    This in an  artificial imbalanced data set.

  • Standards Research Data
  • Last Updated On: 
    Thu, 01/10/2019 - 11:45

    The dataset is used in machine learning method of the "A distributed Front-end Edge node assessment model by using Fuzzy and a learning-to-rank method" paper

  • Communications
  • Last Updated On: 
    Mon, 01/07/2019 - 19:06

    Pages