The method used is to visualize the blood and urine test results, to explore data analysis (EDA). Through exploratory analysis, it summarizes the method of data normalization and establishes a formula. First, blood and urine test results collect data of 5 patients with multiple myeloma. One of the data is the blood and urine test result data collected four times after the symptoms are caused. And comparison data collects blood and urine data in women in their 30s who do not have blood -related diseases. Categorize the terminal patient and stage 1 patient among the collected patient data. Normalization of data is normalized to the same digit. Based on the normalized data, the graph is written on the radar chart, and the graph with the normal patient is compared and analyzed to make meaningful discovery.
Data & Variables
The blood test data of the patient comes from the open data source of ‘Korea Federation of Multiple Myeloma Patients’. Reference values are from Konkuk University medical center. The condition of the collected patient is as follows. Blood data was collected from 5 patients who is diagnosed with multiple myeloma without treated with drugs or chemotherapy. One patient is the results of 3 tests for three months of blood and urine tests that patient who died within eight months of diagnosis. One is a patient who died within three months of diagnosis, and the other 3 patients are the data of blood and urine tests diagnosed in the first stage of MM. (Table 1) The parameter value of each patient is white blood cell(WBC), neutrophil SEG%, lymphocytes, monocytes, eosinophils, basophil, red blood cells(RBC), hemoglobin(Hb), hematocrit(Hct), MCV (Mean corpuscular volume), MCHC (Mean corpuscular hemoglobin concentration), MCH (Mean corpuscular hemoglobin), albumin, and platelet(PLT). As well as Creatine and Calcium values, creatine indicates the condition of the kidneys, and calcium(Ca) indicates the broken bones, were added as parameters. (Table 1)
Table 1 Patient’s parameter value list & reference value
Table 2 Patient list and conditions
No.
|
Patient list
|
Conditions
|
Symptoms
|
1
|
Patient 1
|
first blood & urine tests after leg pain occur
|
Y
|
2
|
Patient 1–1
|
Patient 1's second blood & urine tests result after chest pain occur
|
Y
|
3
|
Patient 1–2
|
Patient 1's third blood & urine tests result: Decreased after 8 months after the tests
|
Y
|
4
|
Patient 2
|
Decreased: diagnosis after 3 months
|
Y
|
5
|
Patient 6
|
Diagnosis as Stage 1, No symptoms but failed to collect stem cell
|
N
|
|
Patient 7
|
Diagnosis as Stage 1, No symptoms
|
N
|
6
|
Patient 8
|
Diagnosis as Stage 1, Bone broke
|
Y
|
7
|
Patient 10
|
Normal, No blood related diseases
|
|
Data Normalization
For accurate comparative analysis, it is necessary to normalize each patient’s parameter values of various parameters (1 to 100 units). The normalization of the data was applied according to the percentage method due to existence of international standard reference value for normal value of blood & urine. The conversion of percentage method is to visualize a graph round as possible for normal patient’s blood &urine test result. Also see what percentage of blood is composed of the maximum value based on the reference value. Explore patients' data before conversion, we find that most of patient’s values are lower than lowest reference value. distinctively at RBC parameter’s each patient’s value, 5 out of 7 patients are under lowest reference value, as well we Hct. Only Monocyte hit above highest reference value.(marketd in yellow) (Table 1) Therefore, normalizing data, we selected highest reference value to convert into a percentage. For the formula, P-value is each patient’s parameter value and R-value is reference value.
Formula:\(\text{X}=\frac{\text{P}-\text{v}\text{a}\text{l}\text{u}\text{e}}{\text{R}-\text{v}\text{a}\text{l}\text{u}\text{e}}\)