Data: (with Big Data and Database Semantics)
Research output: Contribution to journal › Journal article › Research › peer-review
Documents
- Data_big data_Rev2018_11_05
134 KB, Word document
It is argued that data should be defined as information on properties of units of analysis. Epistemologically it is important to establish that what is considered data by somebody need not be data for somebody else. This article considers the nature of data and “big data” and the relation between data, information, knowledge and documents. It is common for all these concepts that they are about phenomena produced in specific contexts for specific purposes and may be represented in documents, including as representations in databases. In that process, they are taken out of their original contexts and put into new ones and thereby data loses some of or all their meaning due to the principle of semantic holism. Some of this lost meaning should be reestablished in the databases and the representations of data/documents cannot be understood as a neutral activity, but as an activity supporting the overall goal implicit in establishing the database. Utilizing (big) data (as it is the case with utilizing information, knowledge and documents) demands first of all the identification of the potentials of these data for relevant purposes. The most fruitful theoretical frame for knowledge organization and data science is the social epistemology suggested by Shera (1951). One important aspect about big data is that they are often unintentional traces we leave during all kinds of activities. Their potential to inform somebody about something is therefore less direct as compared to data that have been produced intentionally as, for example, scientific databases.
Original language | English |
---|---|
Journal | Knowledge Organization |
Volume | 45 |
Issue number | 8 |
Pages (from-to) | 685-708 |
Number of pages | 24 |
ISSN | 0943-7444 |
DOIs | |
Publication status | Published - Dec 2018 |
Bibliographical note
Publikationen er også publiceret online: http://www.isko.org/cyclo/data
Number of downloads are based on statistics from Google Scholar and www.ku.dk
No data available
ID: 209653225