DeepPIM: Deep Neural Point-of-Interest Imputation Model

A point-of-interest (POI) is a specific location in which someone is interested. Users on Instagram, a mobile-based social network, share their experiences with text and photos, and link POIs to their posts. POIs can be utilized to understand user preferences and behavior. However, POI information is not annotated in more than half of the total generated data. Therefore, it is necessary to automatically associate a post on Instagram with a POI. In a previous study, a POI prediction model was trained on the POI-annotated Instagram data which includes user, text, and photo information. However, this model has some limitations such as difficulty in handling a large amount of data and the high cost of feature engineering. In addition, this model does not utilize posting time information which provides each POI's temporal characteristics. In this paper, we propose a novel deep learning based time-aware POI prediction model that processes a large amount of data without feature engineering. Our proposed model utilizes text, photo, user, and time information to predict correct POIs. The experimental results show that our model significantly outperforms the existing state-of-the-art model.

Model description

DeepPIM consists of two DNN layers, textual RNN and visual CNN layers, and two latent feature matrices for user and time embedding.

Data set

Data set is available at here. The data set includes "train.txt", "validation.txt", "test.txt", and "visual_feature.npz". The "train.txt" "validation.txt" "test.txt" files include the training, validation, and tesing data respectively. The data is represented in the following format:

<post_id>\t<user_id>\t<word_1 word_2 ... >\t<poi_id>\t<month>\t<weekday>\t<hour>

All post_id, user_id, word_id, and poi_id are anonymized. Photo information also cannot be distributed due to personal privacy problems. So we relase the converted visual features from the output of the FC-7 layer of VGGNet used as the visual feature extractor. If you want to use other visual feature extractor, such as GoogleNet, ResNet, you could implement it on your source code. We use a pre-trained VGGNet16 by https://github.com/machrisaa/tensorflow-vgg The "visual_feature.npz" file contains the visual features where the i-th row denotes i-th post's features.

statistics

number of total post	number of POIs	number of users	size of vocabulary
736,445	9,745	14,830	470,374
size of training set	size of validation set	size of test set
526,783	67,834	141,828

Getting Started

The code that implements our proposed model is implemented for the above dataset, which includes pre-processd visual feature. If you want to use a raw image that is not pre-processed, implement VGGNet on your source code as visual CNN layer.

Prerequisites

python 2.7
tensorflow r1.2.1

Usage

git clone https://github.com/qnfnwkd/DeepPIM
cd DeepPIM
python train.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
figures		figures
README.md		README.md
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepPIM: Deep Neural Point-of-Interest Imputation Model

Model description

Data set

statistics

Getting Started

Prerequisites

Usage

About

Releases

Packages

Languages

qnfnwkd/DeepPIM

Folders and files

Latest commit

History

Repository files navigation

DeepPIM: Deep Neural Point-of-Interest Imputation Model

Model description

Data set

statistics

Getting Started

Prerequisites

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages