(KR) - [linux] valgrind를 이용하여 binary code를 profiling하기

Total Articles 494

Join
Login

[linux] valgrind를 이용하여 binary code를 profiling하기

gilgil

http://www.gilgil.net/?document_srl=1004687

2017.06.26 20:59:29 (*.1.1.1)

94109

N * N 배열이 있고 이 배열을 전부 0으로 채우는 함수가 있다고 가정한다. loop1()과 loop2()의 실행 속도에는 어떤 차이가 있을까?

static const int N = 4096;
static const int CNT = 10;

void loop1() {
static volatile char arr[N * N];
for (int cnt = 0; cnt < CNT; cnt++)
for (int a = 0; a < N; a++)
for (int b = 0; b < N; b++)
arr[a * N + b] = 0; // (A)
}

void loop2() {
static volatile char arr[N * N];
for (int cnt = 0; cnt < CNT; cnt++)
for (int a = 0; a < N; a++)
for (int b = 0; b < N; b++)
arr[b * N + a] = 0;// (B)
}

loop1()과 loop2()의 함수의 차이는 (A)와 (B) 밖에 없다. 얼핏 봐서는 성능의 차이가 없을 것으로 보이지만, 실제로 실행을 해 보면 많은 차이가 나게 된다. 허걱!!! (숫자의 단위는 nano second)

Screenshot from 2017-06-26 20-14-01.png

이런 차이가 나는 이유를 valgrind, kcachegrind를 이용하여 확인해 보도록 한다. 다음과 같은 명령어를 이용하여 valgrind와 kcachegrind를 설치한다(설치 용량이 100MB 조금 넘는다).

sudo apt install valgrind kcachegrind

실행 파일(loop_test)을 다음과 같이 실행하여 profile 파일을 생성시킨다.

valgrind --tool=callgrind ./loop_test

Screenshot from 2017-06-26 20-21-40.png

같은 폴더에 "callgrind.out.*"이라는 파일이 생성되었음을 확인할 수 있다.

Screenshot from 2017-06-26 20-22-21.png

설치된 kcachegrind 툴을 이용하여 profile 파일을 열어 본다. main함수 내에서 loop1()과 loop2()의 instruction code 실행 횟수가 일치함을 확인할 수 있다. 실제로 disassembled instruction code를 봐도 loop1()과 loop2()가 거의 동일하다.

Screenshot from 2017-06-26 20-27-31.png

이를 가지고서는 실제적으로 loop1()이 loop2()보다 빠르다는 것을 입증할 수는 없다. loop1()이 loop2()보다 빠른 이유는 실행되는 instruction code의 실행 횟수 차이가 아닌 CPU와 Memory간의 cache miss의 차이에 기인한다. 이를 위해서 다시 다음과 같은 명령어를 통하여 cache profiling을 해 보도록 한다.

valgrind –tool=cachegrind ./loop_test

Screenshot from 2017-06-26 20-33-45.png

생성된 "cachegrind.out.*" 파일을 kcachegrind 툴에서 열어 본다. loop1()과 loop2()가 Insturction Fetch에서는 동일한 횟수의 instruction code가 실행되어 지지만

Screenshot from 2017-06-26 20-36-11.png

L1 Data Write Miss에서는 loop1()보다 loop2()가 현저히 높음을 알 수 있다.

Screenshot from 2017-06-26 20-53-08.png

이와 같이 valgrind 관련 툴을 이용하면 binary code profiling을 통해 특정 코드의 실행에 있어서 instruction code의 실행 횟수와 cache miss등의 정보를 확인해 볼 수 있다.

[download]

loop_test.zip

You would..

List

No.	Subject	Author	Date	Views
Notice	컴퓨터 공부를 시작하는 분들을 위하여 19	gilgil	Oct 25, 2009	1812142
Notice	프리웨어 라이브러리 사용할 때의 예의에 대해서 5	gilgil	Feb 06, 2012	1363382
494	object의 type을 찾아 내는 방법(C++)	gilgil	Aug 31, 2018	101491
493	variable length array의 메모리 영역	gilgil	Aug 15, 2018	92875
492	프로그래밍은 대입 수학 문제 풀이가 아니다.	gilgil	Aug 06, 2018	87114
491	BoB와 중부대	gilgil	Jun 25, 2018	97196
490	교육 관련 수상 이력	gilgil	May 25, 2018	101054
489	추억의 게임 2	gilgil	May 19, 2018	101605
488	SNI checking과 SSL inspection 1	gilgil	May 09, 2018	107153
487	GPKI에서 발행(issue)한 인증서의 심각한 문제점 1	gilgil	Apr 05, 2018	102035
486	2MB18nomA	gilgil	Mar 24, 2018	95699
485	SnoopSpy 이야기	gilgil	Mar 18, 2018	95620
484	[nethunter] unauthorized 문제로 adb shell 진입이 안될 때	gilgil	Jan 10, 2018	245336
483	[linux] 간단현 명령어로 linux network 구성하기	gilgil	Dec 30, 2017	97039
482	[network] monitor mode에서 pcap_next_ex가 -1을 return하는 현상	gilgil	Nov 21, 2017	97141
481	[c++] function을 virtual로 선언해도 non-virtual call이 이루어 지는 경우	gilgil	Nov 15, 2017	100236
480	[tip] 질문하는 요령	gilgil	Oct 25, 2017	91242
479	[free] 삼성 CTF 본선 후기	gilgil	Aug 21, 2017	92193
478	[free] 삼성 CTF 예선 후기	gilgil	Jul 14, 2017	145702
477	[c++] qt programming	gilgil	Jun 29, 2017	94876
	[linux] valgrind를 이용하여 binary code를 profiling하기	gilgil	Jun 26, 2017	94109
475	[c++] Google logging library 사용 방법	gilgil	Jun 24, 2017	98147

List

Write

First Page 1 2 3 4 5 6 7 8 9 10 Last Page

[linux] valgrind를 이용하여 binary code를 profiling하기

Home

Downloads

Communities

Communities(KR)

Test