Code for "Retrieving Black-box Optimal Images from External Databases" (WSDM 2022)

Last update: Apr 13, 2022

Related tags

Overview

Retrieving Black-box Optimal Images from External Databases (WSDM 2022)

We propose how a user retreives an optimal image from external databases of web services (e.g., Flickr) with respect to user-defined functions (e.g., deep learning-based score functions.)

💿 Dependency

Please install

wget and unzip, e.g., by sudo apt install wget unzip,
PyTorch from the official website, and
other dependencies by pip install -r requirements.txt.

📂 Files

download.sh downloads and preprocesses the Open Image dataset.
environments.py implements wrappers of APIs, i.e., the oracles in the paper.
evaluate.py is the evaluation script.
methods.py implements Tiara, Tiara-S, and baseline methods.
openimage_feature_extract.py preprocess the Open Image dataset. Please run this script after you download images. This script is automatically run by download.sh.
preprocess_openimage.py preprocess the Open Image dataset. Please run this script before you download images. This script is automatically run by download.sh.
utils.py implements miscellaneous functions, i.e., the word embbeding loader.

🗃️ Download and Preprocess Datasets

$ bash ./download.sh

🧪 Evaluation

Try with Open Image datasets by

$ python evaluate.py --env open --verbose --num_seeds 1 -c 0

The results are saved in outputs directiory.

Please refer to the help command for further options.

$ python evaluate.py -h
usage: evaluate.py [-h] [--tuning] [--extra] [--env {open,flickr,flickrsim}]
                   [--num_seeds NUM_SEEDS] [--budget BUDGET]
                   [--api_key API_KEY] [--api_secret API_SECRET]
                   [--font_path FONT_PATH] [--verbose]
                   [-c [CLASSES [CLASSES ...]]]

optional arguments:
  -h, --help            show this help message and exit
  --tuning
  --extra
  --env {open,flickr,flickrsim}
  --num_seeds NUM_SEEDS
  --budget BUDGET
  --api_key API_KEY     API key for Flickr.
  --api_secret API_SECRET
                        API secret key for Flickr.
  --font_path FONT_PATH
                        Font path for wordclouds.
  --verbose
  -c [CLASSES [CLASSES ...]], --classes [CLASSES [CLASSES ...]]

Flickr API

The Flickr experiments require a Flickr API key. Please get a key from Flickr official website.

🖋️ Citation

@inproceedings{sato2022retrieving,
  author    = {Ryoma Sato},
  title     = {Retrieving Black-box Optimal Images from External Databases},
  booktitle = {Proceedings of the Fifteenth {ACM} International Conference on Web Search and Data Mining, {WSDM}},
  year      = {2022},
}

Code for "Retrieving Black-box Optimal Images from External Databases" (WSDM 2022)

Related tags

Overview

Retrieving Black-box Optimal Images from External Databases (WSDM 2022)

💿 Dependency

📂 Files

🗃️ Download and Preprocess Datasets

🧪 Evaluation

Flickr API

🖋️ Citation

Owner

joisino

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

An efficient implementation of GPNN

Python Auto-ML Package for Tabular Datasets

Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning

Website for D2C paper

Si Adek Keras is software VR dangerous object detection.

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

A two-stage U-Net for high-fidelity denoising of historical recordings

SSD-based Object Detection in PyTorch

Simple SN-GAN to generate CryptoPunks

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers (arXiv2021)

tensorrt int8 量化yolov5 4.0 onnx模型

Fast, general, and tested differentiable structured prediction in PyTorch

Author Disambiguation using Knowledge Graph Embeddings with Literals

OpenMMLab Pose Estimation Toolbox and Benchmark.

Scripts and outputs related to the paper Prediction of Adverse Biological Effects of Chemicals Using Knowledge Graph Embeddings.

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

Multi-Stage Episodic Control for Strategic Exploration in Text Games