Robin on Linux

↧

To construct DataFrame more effectively

February 20, 2020, 4:05 pm

The old code of python looks like:import pandas as pd temp = pd.DataFrame() for record in table: df = pd.DataFrame(record) temp = pd.concat([temp, df]) # The final result result = tempThis snippet...

View Article

Some tips about Argo Workflows (on Kubernetes)

March 1, 2020, 7:35 pm

Using Argo to execute workflows last week, I met some problems and also find the solutions. 1. Can’t parse “outputs” By submitting this YAML file:apiVersion: argoproj.io/v1alpha1 kind: Workflow...

View Article

Image may be NSFW.
Clik here to view.

Grab a hands-on realtime-object-detection tool

March 5, 2020, 9:28 pm

Try to get a fast (what I mean is detecting in lesss than 1 second on mainstream CPU) object-detection tool from Github, I experiment with some repositories written by PyTorch (because I am familiar...

View Article

Be careful of the ternary operator in Python

March 12, 2020, 5:37 pm

from pathlib import Path date = yes my_path = Path(hello) / date if date else no / last print(my_path) The result will be: hello/yes Where is the last go? It goes with the no. The python interpreter...

View Article

Image pull policy in Kubernetes

March 20, 2020, 11:22 pm

Recently, we use Kubernetes for our project. Yesterday, a problem haunted me severely: even I have pushed the docker image to the GCR (Goolge Container Registry), the pod in Kubernetes will still use...

View Article

Use both ‘withParam’ and ‘when’ in Argo Workflows (on Kubernetes)

March 26, 2020, 7:45 pm

In Argo, we can use ‘withParam’ to create loop logic: - - name: generate template: gen-number-list # Iterate over the list of numbers generated by the generate step above - - name: sleep template:...

View Article

Problems about using treelite in Kubernetes

April 2, 2020, 7:34 pm

treelite is an easy-to-use tool to accelerate prediction speed of XGBoost/LightGBM models. Three days ago, I tested it in my 4-CPU-cores virtual machine and found out that it could reduce the running...

View Article

Recent learned tips abou Numpy and Pandas

April 26, 2020, 9:47 pm

Precision After running this snippet:import numpy as np a = np.array([0.112233445566778899], dtype=np.float32) b = np.array([0.112233445566778899], dtype=np.float64) print(a, b)It print...

View Article

Use `psql` to download data as CSV file

May 6, 2020, 4:28 pm

Although SQL WorkBench is a handy tool for querying AWS Redshift, we still need to CLI tool for automation. To install psql on MacOS, we need tobrew install postgresqlThen we could download data...

View Article

Image may be NSFW.
Clik here to view.

The nn.Sigmoid() of PyTorch on Android device

May 15, 2020, 11:54 pm

I have trained an EfficientNet model to classify more than ten thousand different categories of birds, by using PyTorch. To run this model on the mobile device, I built a program by learning the...

View Article

Some tips about BigQuery on GCP

May 21, 2020, 4:18 am

Migrate SQL script from AWS Redshift to BigQueryCONVERT_TIMEZONE('AEDT',getdate())::DATE in Redshift should be changed to current_date("Australia/Sydney") in BigQuery. Since BigQuery doesn’t force type...

View Article

A problem about running Argo

May 29, 2020, 9:36 pm

After I launched an Argo workflow, it just hanged on ContainerCreating stage. After waiting for more than 10 minutes, it hasn’t changed at all. Then I found this article. After usingkubectl describe...

View Article

Image may be NSFW.
Clik here to view.

Failed to establish pod watch in Argo

June 4, 2020, 9:18 pm

After creating a brand new Kubernetes cluster in GKE, I launched an Argo workflow but saw these errors: Argo will create two containers for a step: ‘main’ container and ‘wait’ container. But why the...

View Article

Image may be NSFW.
Clik here to view.

Experiments on Bayesian Optimization

June 11, 2020, 6:56 pm

Bayesian Optimization is a popular searching algorithm for hyper-parameters in the machine learning area. There are also two popular Python libraries for this algorithm: Hyperopt and Optuna. So I have...

View Article

Image may be NSFW.
Clik here to view.

First trial for PyCaret

June 19, 2020, 6:41 pm

The first time I noticed PyCaret is from the recommendation page from Google Chrome. Recent days I got time to test it. The original test program failed because there is a column which has a number...

View Article

Image pull policy in Kubernetes

March 20, 2020, 11:22 pm

View Article

Image may be NSFW.
Clik here to view.

The first trial for PyCaret

June 25, 2020, 7:00 pm

The code for using PyCaret is quite simple: df = pd.read_csv(TRAIN_CSV_FILE) setup(data=data, target="TARGET", session_id=1023) compare_models(verbose = False) But it reported error in the first run:...

View Article

Submit Argo workflow to different clusters

July 2, 2020, 5:48 pm

A couple of days ago I am looking for a tool to manage different Kubernetes Clusters in my only laptop. But after a while, I realized that kubectl actually support multi-clusters by itself (link)....

View Article

Using GPU for LightGBM

July 9, 2020, 6:10 pm

One of my team members had accomplished some tests on using GPU for LightGBM training. The result is quite good that GPU could accelerate training speed to 2 times fast. But this also rises up my...

View Article

Using loop in Jsonnet

July 15, 2020, 6:16 pm

Jsonnet is a templating language and tool to generate JSON/YAML files. Since already have a language instead of configuration, we can generate a bunch of configuration issues with simple code. For...

View Article