艳艳儿

[Getting and Cleaning data] Week 1

Week 1
- Raw Data VS Processed Data
- Raw Data VS Tidy Data
- Download files
- Reading flat local file
- Reading Excel files
- Reading XML file
- Reading JSON
- Using datatable

For more detail, you can download the html file here free

Week 1

This course is following that

Raw data → Processing script → Tidy data → Data analysis → Data communication

Raw Data VS Processed Data:

Raw Data

The original source of the data
Often hard to use for data analysis. Because they are complicated or they are hard to parse or analysis.
Data analysis includes processing or the cleaning of the data.
Raw data may only need to be proceed once

Processed Data:

Data that is ready for analysis.
Processing can include merging, subsetting, transforming, etc.
There may be standards for processing.
All steps should be recorded.(very important)

Data processing actually is part of the data analysis. In fact a huge component of data scientist’s job is performing those sorts of processing operations. The raw data may only need to be processed once, but regardless of how often you processed it, you need to keep a record of all the different things you did. Because it can have a major impact on the data stream analysis.

Raw Data VS Tidy Data

Raw Data:

Ran no software on the data.
Did not manipulate any of the numbers in the data.
You did not remove any data from data set.
You did not summarize the data in any way.

Tidy Data:

Each variable you measure should be in one column.
Each different observation of that variable should be in a different row.
There should be one table for each “kind” of variable.
If you have multiple tables, they should include a column in the table that allow them to be linked.

Other important tips about tidy data:

Include a row at the top of each file with variable names.
Make variable names human readable.
In general data should be saved in one file per table.

Four things you should have from raw data to tidy data:

The raw data.
A tidy data set.
A code book describing each variable and its values in the tidy data.
An explicit and exact recipe you used to go from 1 -> 2,3

Code book:

Information about the variables (including units) in the data set.
Information about the sumary choices you make.
Information about the experimental study design you used.

other important tips about code book:

A common format for this document is a Word/text file.
There should be a section called “Study design” that has a throughout description of how you collected the data.
There must be a section called “Code book” that decribles each variable and its units.

Download files

Get/set your working directory:

A basic component of working with data is knowning your working directory.
The two main commands are getwd() and setwd().
Be aware of relative versus absolute paths:
- Relative: setwd("./data") or setwd("../")
- Absolute: setwd("/Users/jtleek/data/")
Important difference in Windows setwd("C:/Users/Andraw/Downloads") or setwd("C:\\Users\\Andra\\Downloads")

Checking for and creating directories:

file.exists("directoryName") will check to see if the directory exists.
dir.create("directoryName") will create a directory if if doesnot exist.

Example:

if(!file.exists("data")){
dir.create("data")
}

Getting data from the internet–downloand.file()

Downloads a file from the internet.
Even if you could do this by hand, helps with reproducibility.
Important parameters are url, destfile and method.
Useful for downloading tab-delimited, csv. and other files.

Example:

# data from https://data.baltimorecity.gov/Transportation/Baltimore-Fixed-Speed-Cameras/dz54-2aru
fileUrl <- "https://data.baltimorecity.gov/api/views/dz54-2aru/rows.csv?accessType=DOWNLOAD"
download.file(fileUrl, destfile = "./data/camera.csv")
list.files("./data")

Linux should be a little difference with the second line:
download.file(fileUrl, destfile = "./data/camera.csv" , method = "curl" )

An important component of downloading files from the internet is that those files might change. So for example they change the cameras, there might be a new set of cameras and the data we are analysis might be different.

dateDownloaded <- date()
dateDownloaded

Some notes about download.file():

If the url starts with http you can use download.file().
If the url starts with https on Windows you may be OK.
If the url starts with https on Mac you may be need to set method = "curl".
If the file is big, this might take a while.
Be sure to record when you downloaded.

Reading flat local file

Loading flat files–read.table():

This is the main function for reading data into R
Flexible and robust by requires more parameters
Reads the data inrto RAM - big data can cause problem.
Important parameters file, header, sep, row.names and nrow.
Related: read.csv() and read.csv2()

Example:
If we use
cameraData <- read.table("./data/camera.csv"),
we will get an error with
Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, : line 1 did not have 13 elements.
The reason why is because that there’s commas separating camera.csv. But the default for read.table() is to look for a tab delimited file. There are two ways you can use to read this data:

cameraData <- read.table("./data/camera.csv", sep = ",", header = TRUE)
head(cameraData,3)

or (read.csv() automatically set sep = "," and header = TRUE)

cameraData <- read.csv("./data/camera.csv")
head(cameraData, 3)

Some more important parameters:

quote: you can tell R whether there are any quoted values quote = "" means no quotes.
na.strings: set the character that represents a missing value.
nrows: how many rows to read of the file(e.g. nrows = 10 reads 10 lines).
skip: number of lines to skip befor starting to read.

In my experience, the biggest trouble with reading flat files are quotation marks ’ or ” placed in data values, setting quote = "" often resolves these.

Reading Excel files

Excel files are still probablu the most widely used format for sharing data.

Example:

#  Download the file to load
fileUrl <- "https://data.baltimorecity.gov/api/views/dz54-2aru/rows.xlsx?accessType=DOWNLOAD"
download.file(fileUrl, destfile = "./data/camera.xlsx", mode="wb")
dateDownloaded <- date()

# install xlsx package first
## install.packages("xlsx")

# load package and rad excel data
library(xlsx)
cameraData <- read.xlsx("./data/camera.xlsx", sheetIndex = 1, header = TRUE)
head(cameraData)

Linux should add method = "curl" in the second line and delete mode = "wb" in the third line.

mode = "wb" is very important in Windows because without it there will be an error: Error in .jcall("RJavaTools", "Ljava/lang/Object;", "invokeMethod", cl, : java.util.zip.ZipException: invalid distance too far back.

Reading specific rows and columns:

colIndex <- 2:3
rowIndex <- 1:4
cameraDataSubset <- read.xlsx("./data/camera.xlsx", sheetIndex = 1, colIndex = colIndex, rowIndex = rowIndex, mode = "wb")
cameraDataSubset

Further notes:

The write.xlsx() will write out an excel file with similar arguments.
read.xlsx2() is muc faster than read.xlsx() but for reading subsets of rows may be unstable.
The XLConnet package has more options for writting and manipulation excel files.
The XLConnect vignette is a good place to start for that package.
In general it is advised to store your data in either a database or in comma separated data(.csv) or tab separated data(.tab or .txt) as they are easier to distribute.

Reading XML file

XML:

Extensible markup language.
Frequently used to store structured data.
Particularly widely used in internet applications.
Extracting XML is the basis for most web scraping.
Components
- Markup: labels that gives the text structure.
- Content: the actual text of the document.

Tags, elements and attributes:

Tages corresponding to general labels:
- Start tags
- End tags
- Empty tags
ELements are specific examples of tags
- Hello world M
Attributes are components of the labels.
- Conect A to B.

Read XML file into R

# install XML package first
## install.packages("XML")
library(XML)
fileUrl <- "http://www.w3schools.com/xml/simple.xml"
doc <- xmlTreeParse(fileUrl, useInternal = TRUE)
rootNode <- xmlRoot(doc)
xmlName(rootNode)
names(rootNode)

xmlTreeParse() parses out the xml file： It loads the document into a R memory in a way. and then parse it and get access to different parts of it. Within R, it’s still a structurd object, so we have to be able to use different functions to access different parts of that object.
xmlRoot() is excuted, you will have access to that particular element to that xml file.
xmlName() gives the name of xml file.
names() gives what all the nested elements with that root node are.
So the root node wraps the whole document. And the whole ducument here is a breakfast menu and then there are five different breakfast items on this menu and each one is wrapped within a food elememt. So you have five food element, if you look at the names of the root node.

The next thing that you could use is to directly access parts of the XML document. You can do it in a little bit in the same way you access a list in R

# first element
rootNode[[1]]
# first element of the first element
rootNode[[1]][[1]]
# extract different parts of the file programmatically
xmlSApply(rootNode, xmlValue)

xmlSApply() what you do is you pass that a parsed XML object and then you tell it what function you’d like to apply. So what that’s is going to do is going to loop through all of the elements of the XML root node and get the XML value.
xmlValue Some types of XML nodes have no children nodes, but are leaf nodes and simply contain text

Xpath:new language

/node Top level node
//node Node at any level
node[@attr-name] Node with an attribute name
node[@attr-name="bob"] Node with attribute name attr-name==”bob”

Get the items on the menu and prices

# extract content by elements
xpathSApply(rootNode, "//name", xmlValue)
xpathSApply(rootNode, "//price", xmlValue)

Extract content by attributes

# Extract content by attributes
fileUrl <- "http://espn.go.com/nfl/team/_/name/bal/baltimore-ravens"
doc <- htmlTreeParse(fileUrl, useInternal = TRUE)
# find li elements with `class = "team-name"` and return their value
teams <- xpathSApply(doc, "//li[@class='team-name']", xmlValue)
teams

Reading JSON

JSON:

JavaScript Object Notation
Ligntweight data storage
Common format for data from application proframming interfaces(APIs)
Similar structure to XML but different syntax/format
Data stored as
- Numbers(double)
- Strings(double quoted)
- Boolean(true or false)
- Array(ordered, comma separated enclosed in square brackets[])
- Object(unorderd, comma separated collection of key:value pair in curley brackets{})

Reading data from JSOn {jsonlite package}

# install package first
## install.packages("jsonlite")
library(jsonlite)
# what you get from fromJSON function is a structured data frame
jsonData <- fromJSON("https://api.github.com/users/jtleek/repos")
# all names of this data frame
names(jsonData)
# look at the names of that particular variable
names(jsonData$owner)
jsonData$owner$login

How convert data frame to JSON

# writing data frame to JSON
myjson <- toJSON(iris, pretty = TRUE)
# print it out: too long, you can view it yourself
#cat(myjson, nrow = 2)
# fromJSON return data frame again
iris2 <- fromJSON(myjson)
head(iris2, 3)

Using data.table

data.table:

Inherets from data frame
- All functions that accept data.frame work on data.table
Written in C so it is much faster
Much, much faster at subsetting, group, and updating.

Create data tables just like data frames

# install package first
## install.packages("data.table")
library(data.table)
# data frame
DF <- data.frame(x = rnorm(9), y = rep(letters[1:3], each = 3), z = rnorm(9))
head(DF, 3)
# data table
DT <- data.table(x = rnorm(9), y = rep(letters[1:3], each = 3), z = rnorm(9))
head(DT, 3)

See all the data tables in memry

tables()

Subsetting rows

DT[2,]
DT[DT$y=="a",]
DT[c(2,3)]
# or 
DT[c(2,3),]

Subsettng columns:

What happens when you try to subset columns, if you just try to subset columns the way you used to in data frame, this is where they really diverge data table and data frame. It’s not actually trying to subset the columns using the same subsetting functions that happens with data frame. it does something a little bit different. And so what it’s using is expressions to be able to summarize the data in variour, different ways.

The subsetting function is modified for data.table
The argument you pass after the comma is called an “expression”
In R an expression is a collection of statements enclosed in curley brackets.

# expression in R
k <- {print(10); 5}
print(k)

Calculating values for variables with expressions

DT[, list(mean(x), mean(z))]
DT[,table(y)]

Another thing that it does very fast and memory efficiently is to add a new column. The nice thing is usually when you are adding a new variable to a data frame, R will copy over the entire data frame and add a new variable to it. So you get two copies of the data frame in the memory. So when dealing with big data sets, this is obviously going to cause lots of memory problems which you don’t have with data table because a new copy isn’t being created. So you have to be able to, if you’re trying to create a copy you have to explicitly do that with the copy function.

DT[, w:=z^2]
DT2 <- DT
DT[, y:=2]
head(DT, 2)
head(DT2, 2)

Multiple operations

DT[, m:={tmp <- {x+z}; log2(tmp+5)}]

plyr like operations

DT[,a:=x>0]
DT[,b:=mean(x+w),by=a]

Special variable
.N An integer, length 1, containing the number r

set.seed(123)
DT <- data.table(x = sample(letters[1:3], 1E5, TRUE))
# number of a, b, c apperance
DT[, .N,by=x]

Keys:

A unique aspect of data tables is that that have keys, and so if you set the key, it’s possible to subset and sort a data tbale much more rapidly than you would be able to do with a data frame.

DT <- data.table(x = rep(letters[1:3], each = 100), y = rnorm(100))
# set keys
setkey(DT, x)
# subsetting rows with x == "a"
head(DT["a"])

Joins or merge data table using keys

DT1 <- data.table(x = c("a", "a", "b", "dt1"), y = 1:4)
DT2 <- data.table(x = c("a", "b", "dt2"), z = 5:7)
# set keys
setkey(DT1, x)
setkey(DT2, x)
# joint
merge(DT1, DT2)

Fast reading

big_df <- data.frame(x = rnorm(1E6), y = rnorm(1E6))
file <- tempfile()
write.table(big_df, file = file, row.names = FALSE, col.names = TRUE, sep = "\t", quote = FALSE)
# fread command chould be applied to read data tables
system.time(fread(file))
system.time(read.table(file, head = TRUE, sep = "\t")) # about 10 times slower

你可能感兴趣的:(statistics,R,coursera,data,science)

（二）SAP Group Reporting (GR) 核心子模块功能及数据流向架构解析
数据如何从子公司流转到合并报表的全过程，即数据采集→合并引擎→报表输出，特别是HANA内存计算如何优化传统ETL瓶颈。SAPGroupReporting(GR)核心模块功能及数据流向的架构解析，涵盖核心组件、数据处理流程和关键集成点，适用于S/4HANA1809+版本：一、核心功能模块概览模块功能关键事务码/FioriApp数据采集(DataCollection)整合子公司财务数据（SAP/非SA
900 万人次都在用！打印机驱动大师：兄弟驱动安装一步到位文哥工具箱2 软件工程电脑开源软件
各位打印界的老铁们，你们知道吗？我就是那个传说中服务PT-18R标签打印机的“最佳损友”小助手！当你想把电脑里那些花里胡哨的标签设计变成能摸得着的实物时，嘿嘿，软件下载地址本助手就闪亮登场啦！插上USB线的瞬间，我立马在你电脑里“安营扎寨”，悄悄给你和打印机搭起一座“鹊桥”，让你们无障碍沟通，那叫一个丝滑！你在编辑软件里鼓捣的文字、条形码，甚至那些可可爱爱的小图标，全靠我这个“翻译官”精准转换成打
深入理解汇编语言子程序设计与系统调用网安spinage 汇编语言开发语言汇编算法
本文将全面解析汇编语言中子程序设计的核心技术以及系统调用的实现方法，涵盖参数传递的多种方式、堆栈管理、API调用等关键知识点，并提供实际案例演示。一、子程序设计：参数传递的艺术1.寄存器传参：高效简洁.386.modelflat,stdcalloptioncasemap:none.dataxdd5;定义变量ydd6sumdd?.code;函数定义：addxy1addxy1procpushebpmo
DPDK 技术详解：榨干网络性能的“瑞士军刀”
你是否曾感觉，即使拥有顶级的服务器和万兆网卡，你的网络应用也总是“喂不饱”硬件，性能总差那么一口气？传统的网络处理方式，就像在高速公路上设置了太多的收费站和检查点，限制了数据包的“奔跑”速度。今天，我们要深入探讨一个能够打破这些瓶颈，让你的网络应用快到飞起的“黑科技”——DPDK(DataPlaneDevelopmentKit，数据平面开发套件)。这不仅仅是一个工具包，更是一种全新的网络处理哲学。
手把手教你用C语言实现顺序表
hello，大家好，本篇文章旨在为大家讲解如何使用C语言实现顺序表，还有就是小编自己复习一下相关知识，OK，那我们现在开始。在通讯录中，有增删查改等功能，那么顺序表我们也会对以上功能进行实现。一、创建并初始化顺序表1.创建typedefintSLDataType;#defineINIT_CAPACITY4//动态顺序表--按需申请typedefstructSeqList{SLDataType*a;
Anaconda 和 Miniconda：功能详解与选择建议古月฿ python入门 python conda
Anaconda和Miniconda详细介绍一、Anaconda的详细介绍1.什么是Anaconda？Anaconda是一个开源的包管理和环境管理工具，在数据科学、机器学习以及科学计算领域发挥着关键作用。它以Python和R语言为基础，为用户精心准备了大量预装库和工具，极大地缩短了搭建数据科学环境的时间。对于那些想要快速开展数据分析、模型训练等工作的人员来说，Anaconda就像是一个一站式的“数
MySQL复习题
一.填空题1.关系数据库的标准语言是SQL。2.数据库发展的3个阶段中，数据独立性最高的是阶段数据库系统。3.概念模型中的3种基本联系分别是一对一、一对多和多对多。4.MySQL配置文件的文件名是my.ini或my.cnf。5.在MySQL配置文件中，datadir用于指定数据库文件的保存目录。6.添加IFNOTEXISTS可在创建的数据库已存在时防止程序报错。7.MySQL提供的SHOWCREA
Pandas：数据科学的超级瑞士军刀科技林总 DeepSeek学AI 人工智能
**——从零基础到高效分析的进化指南**###**一、Pandas诞生：数据革命的救世主****2010年前的数据分析噩梦**：```python#传统Python处理表格数据data=[]forrowincsv_file:ifrow[3]>100androw[2]=="China":data.append(float(row[5])#代码冗长易错！```**核心痛点**：-Excel处理百万行崩
《UNIX网络编程卷1：套接字联网API》第8章：基本UDP套接字编程深度解析
《UNIX网络编程卷1：套接字联网API》第8章：基本UDP套接字编程深度解析（8000字图文实战）一、UDP协议核心特性与编程模型1.1UDP协议设计哲学UDP（UserDatagramProtocol）是面向无连接的传输层协议（图1），其核心特征包括：无连接通信：无需三次握手，直接发送数据报尽最大努力交付：不保证可靠性、不维护连接状态报文边界保留：接收方读取的数据与发送方写入完全一致低开销高效
免费编程课程大汇总：从入门到精通的一站式资源大力出奇迹985 人工智能大数据
在数字化时代，编程已成为一项至关重要的技能，无论是为了职业发展还是个人兴趣，学习编程都极具价值。本文精心汇总了丰富的免费编程课程资源，涵盖从基础入门到精通的各个阶段。通过全面介绍如Coursera、edX等在线学习平台，Codecademy、freeCodeCamp等交互式学习网站，以及B站、网易云课堂等视频课程平台的免费课程，为编程学习者提供了一站式的资源指南，帮助读者轻松开启编程学习之旅，逐步
Spark SQL架构及高级用法 Aurora_NeAr spark sql 架构
SparkSQL架构概述架构核心组件API层（用户接口）输入方式：SQL查询；DataFrame/DatasetAPI。统一性：所有接口最终转换为逻辑计划树（LogicalPlan），进入优化流程。编译器层（Catalyst优化器）核心引擎：基于规则的优化器（Rule-BasedOptimizer,RBO）与成本优化器（Cost-BasedOptimizer,CBO）。处理流程：阶段输入输出关键动
2/7-1组-泡菜上海健康产业蔡文俊
片段主题：分享、分享、再分享片段来源：认知突围R:很多人并不清楚，其实分享是最好的学习方法。第一，分享是一件利己利人的事。我把好的思想或方法论分享给你，你多少有所得。反过来，也许你会提出一些不同的东西，触发我更多的思考。哪怕你并没有提出什么新见解，单单是我自己在分享的时候，也是一个对知识的自我记忆和强化的过程，依然是非常有好处的。第二，分享是一件能倒逼你完善知识体系的事。很多人都有过这样的经历，当
数据库基础概念梳理 22:30Plane-Moon 数据库
1.数据存储类型表(Table):存储结构化数据的标准方式，数据以行和列的形式组织，具有固定的格式。非结构化数据(UnstructuredData):如音频、视频、图片、文本文档等，其格式不固定，不易直接用表存储。2.SQL的核心优势SQL尤其擅长处理和操作存储在表中的结构化数据。2.1数据类型约束(DataTypeConstraints):定义列可存储的数据种类。整数类型:TINYINT(1字节
SQL笔记纯干货 AI入门修炼 oracle 数据库 sql
软件：DataGrip2023.2.3，phpstudy_pro,MySQL8.0.12目录1.DDL语句（数据定义语句）1.1数据库操作语言1.2数据表操作语言2.DML语句（数据操作语言）2.1增删改2.2题2.3备份表3.DQL语句（数据查询语言）3.1查询操作3.2题一3.3题二4.多表详解4.1一对多4.2多对多5.多表查询6.窗口函数7.拓展:upsert8.sql注入攻击演示9.拆表
Java：数据结构-ArrayList和顺序表（2） blammmp java 数据结构开发语言
一ArrayList的使用1.ArrayList的构造方法第一种（指定容量的构造方法）创建一个空的ArrayList，指定容量为initialCapacity。publicArrayList(intinitialCapacity){if(initialCapacity>0){this.elementData=newObject[initialCapacity];}elseif(initialCap
Qt 下拉框QComboBox控件：从入门到实战
一、QComboBox核心功能解析1.核心属性属性说明当前示例场景count列表项总数统计学历下拉框中的选项数量editable是否允许用户编辑学历选择时可输入自定义学历currentText当前选中项的文本获取用户选择的"硕士"文本currentData当前选中项的附加数据获取太原对应的区号"0351"currentIndex当前选中项的索引位置(从0开始)确定"硕士"在列表中的位置2.核心方法
ubuntu qt环境下出现No suitable kits found解决方案
1.清理QtCreator缓存QtCreator会缓存项目配置、索引等数据，可能导致某些异常。清理方法：(1)删除QtCreator配置目录bashrm-rf~/.config/QtProject/（Ubuntu/Linux）或Windows：cmdrmdir/s/q"%APPDATA%\QtProject"(2)清除QtCreator的编译缓存bashrm-rf~/.cache/QtProjec
数据结构2-集合类ArrayList与洗牌算法
文章目录★引言：一.MyArrayList模拟实现（一）IList（二）MyArrayList（1）add(Tdata)（2）add(intpos,Tdata)（3）IllgalPosException（4）indexOf(ObjecttoFind)（5）contains(ObjecttoFind)（6）get(intpos)（7）set(intpos,Tvalue)（8）remove(Objec
Spring Boot与云原生：微服务架构的创新实践 tmjpz04412 spring kubernetes 云原生 java graphql
引言：Spring生态的演进与现状Spring框架的发展历程与核心设计理念当前Spring生态的核心组件（SpringBoot、SpringCloud、SpringData等）行业对Spring生态的依赖与创新需求SpringBoot的创新实践1.自动化配置与启动优化条件装配（@Conditional）的深度定制案例启动类加载机制与类路径扫描优化示例：通过自定义Starter实现快速集成第三方服务
Java注解笔记 m0_65470938 java 开发语言
一、什么是注解Java注解又称Java标注，是在JDK5时引入的新特性，注解(也被称为元数据)Javaa注解它提供了一种安全的类似注释的机制，用来将任何的信息或元数据(metadata)与程元素类、方法、成员变量等)进行关联二、注解的应用1.生成文档这是最常见的，也是iava最早提供的注解2.在编译时进行格式检查，如@Overide放在方法前，如果你这个方法并不是看盖了超类Q方法，则编译时就能检查
宝妈兼职的10个项目这些较适合,这10个适合长期发展的正规兼职请收藏氧惠全网优惠
大家好！我是氧惠平台最大团队&联合创始人破局导师。相较于其它返利app，氧惠佣金更高，模式更好，终端用户不流失！氧惠目前属于邀请制社交电商，需要凭邀请码才能体验，请输入我的邀请码：SJ8M8R，进行体验吧！宝妈兼职是一种越来越受欢迎的工作方式，适用于压力山大的家庭主妇、退休妇女以及需要额外收入的人们。这种工作方式可以实现财务自由和工作生活平衡。下面将为大家介绍10种适合宝妈兼职的正规长期项目。第一
R语言笔记Day1（排序、筛选以及分类汇总））养猪场小老板
一、排序1、单变量序列排序2、数据表（矩阵）排序二、筛选三、分类汇总一、排序1、单变量序列排序rank、sort和order函数>aa[1]315#rank用来计算序列中每个元素的秩#这里的“秩”可以理解为该元素在序列中由小到大排列的次序#上面例子给出的序列[3,1,5]中，1最小，5最大，3居中#于是1的秩为1，3的秩为2，5的秩为3，(3,1,5)对应的秩的结果就是(2,1,3)>rank(a
2022年拼多多618活动什么时候开始和结束日常购物技巧呀
马上就是618咯，小伙们准备好购物资金了吗？可不能将鸡蛋都放在一个篮子里，多去各个电商平台逛逛，享受多重618优惠服务。那么2022拼多多618活动什么时候开始呢？快和小编一起来了解一下吧。时间：5.29日20点—6.20日每天抽三次红包，最高可领28888r‼️参与方式很简单，直接去桃宝搜官方密令即可领取，现在分享给大家！密令：【天降红包61666】这是官方密令，中大包的概率更高，现在就可以去搜
Windows下Oracle安装图解叫我老村长
Windows下Oracle安装图解----oracle-win-64-11g详细安装步骤一、Oracle下载官方下地址http://www.oracle.com/technetwork/database/enterprise-edition/downloads/index.htmlwin32位操作系统下载地址：http://download.oracle.com/otn/nt/oracle11g
Linux文件权限与进程管理解析雨季西柚 linux
控制对文件的访问1。什么是文件系统权限？它是如何工作的？如何查看文件的权限？答：文件系统权限就是规定谁能对文件/文件夹做什么（比如看、改、删）的规则。简单说，就是分"所有者、所属组、其他人"三类，给每类分配"读、写、执行"三种权限。操作时系统先看你属于哪类，再查有没有对应权限，有就允许，没有就拦着。查看方式：Linux/mac：终端输ls-l文件名，看开头的rwxr--r--这类字符（3个一组，对
深入理解 UDP 协议：从原理到实战的技术解析
UDP（UserDatagramProtocol，用户数据报协议）作为TCP的"轻量型伙伴"，在实时通信、流媒体传输等场景中发挥着不可替代的作用。与TCP的可靠传输不同，UDP以"简单、快速、无连接"为设计理念，为对延迟敏感的应用提供了高效传输方案。本文将从技术底层出发，系统解析UDP的核心机制、应用场景及实战实现，帮助读者构建对UDP协议的完整认知。一、UDP协议的核心定位与特性1.1协议栈中的
python3中，pycharm中怎么连接数据库 weixin_33736832 数据库 python 开发工具
因为python3现在还不能直接连接数据库，所有如果想连接，就只能通过以下方法：在APP中的，__init__.py中，添加以下代码就可以：importpymysqlpymysql.install_as_MySQLdb()当然前提是，那就的在setting.py中连接数据库添加所连接的mysql数据库的详细信息，如下：DATABASES={'default':{'ENGINE':'django.d
第三方库&第三方平台 lllaa
1.AFNetworking、MJRefresh、SDWebImage、Masonry、MJExtensionMBProgressHUDYYText、YYModel2.友盟分享极光推送神策TalkingData数盟可信ID能帮助APP公司在不同场景下确认设备唯一性，识别修改设备及复用、虚拟机刷量等行为，可以反作弊、防刷单，并通过数字联盟生成的设备ID和客户账户体系的关联，实时有效识别小号恶意注册等
JavaScript正则表达式去除括号但保留内容与去除括号与内容 Selicens javascript 正则表达式
项目上碰到一个需求，是取多个递增文件的文件名，类似于test(1).txt、test(2).txt，但是不需要括号，只要test1、test2这种格式，最开始想到的办法就是js里的replace替换，先上一个比较笨但是也能实现效果的例子letname="test(1).txt"letdata=name.split('.')[0].replace('(','').replace(')','')con
智慧施工：AI技术赋能建筑安全监测新纪元
开发AI智能应用，就下载InsCodeAIIDE，一键接入DeepSeek-R1满血版大模型！智慧施工：AI技术赋能建筑安全监测新纪元在现代建筑行业中，施工安全始终是核心关注点之一。随着科技的飞速发展，人工智能（AI）和大数据分析逐渐成为提升施工安全的重要工具。本文将探讨如何利用智能化软件和大模型API来构建高效的施工安全监测系统，并介绍一款强大的开发工具——InsCodeAIIDE的应用场景及其
jquery实现的jsonp掉java后台知了ing java jsonp jquery
什么是JSONP？先说说JSONP是怎么产生的：其实网上关于JSONP的讲解有很多，但却千篇一律，而且云里雾里，对于很多刚接触的人来讲理解起来有些困难，小可不才，试着用自己的方式来阐释一下这个问题，看看是否有帮助。 1、一个众所周知的问题，Ajax直接请求普通文件存在跨域无权限访问的问题，甭管你是静态页面、动态网页、web服务、WCF，只要是跨域请求，一律不准； 2、
Struts2学习笔记 caoyong struts2
SSH : Spring + Struts2 + Hibernate 三层架构(表示层,业务逻辑层,数据访问层) MVC模式 (Model View Controller) 分层原则:单向依赖，接口耦合 1、Struts2 = Struts + Webwork 2、搭建struts2开发环境 a>、到www.apac
SpringMVC学习之后台往前台传值方法满城风雨近重阳 springMVC
springMVC控制器往前台传值的方法有以下几种： 1.ModelAndView 通过往ModelAndView中存放viewName：目标地址和attribute参数来实现传参： ModelAndView mv=new ModelAndView(); mv.setViewName="success
WebService存在的必要性？一炮送你回车库 webservice
做Java的经常在选择Webservice框架上徘徊很久，Axis Xfire Axis2 CXF ，他们只有一个功能，发布HTTP服务然后用XML做数据传输。是的，他们就做了两个功能，发布一个http服务让客户端或者浏览器连接，接收xml参数并发送xml结果。当在不同的平台间传输数据时，就需要一个都能解析的数据格式。但是为什么要使用xml呢？不能使json或者其他通用数据
js年份下拉框 3213213333332132 java web ee
<div id="divValue">test...</div>测试 //年份 <select id="year"></select> <script type="text/javascript"> window.onload =
简单链式调用的实现技术归来朝歌方法调用链式反应编程思想
在编程中，我们可以经常遇到这样一种场景：一个实例不断调用它自身的方法，像一条链条一样进行调用这样的调用你可能在Ajax中，在页面中添加标签： $("<p>").append($("<span>").text(list[i].name)).appendTo("#result"); 也可能在HQ
JAVA调用.net 发布的webservice 接口 darkranger webservice
/** * @Title: callInvoke * @Description: TODO(调用接口公共方法) * @param @param url 地址 * @param @param method 方法 * @param @param pama 参数 * @param @return * @param @throws BusinessException
Javascript模糊查找 | 第一章循环不能不重视。 aijuans Way
最近受我的朋友委托用js+HTML做一个像手册一样的程序，里面要有可展开的大纲，模糊查找等功能。我这个人说实在的懒，本来是不愿意的，但想起了父亲以前教我要给朋友搞好关系，再加上这也可以巩固自己的js技术，于是就开始开发这个程序，没想到却出了点小问题，我做的查找只能绝对查找。具体的js代码如下： function search(){ var arr=new Array("my
狼和羊，该怎么抉择 atongyeye 工作
狼和羊，该怎么抉择在做一个链家的小项目，只有我和另外一个同事两个人负责，各负责一部分接口，我的接口写完，并全部测联调试通过。所以工作就剩下一下细枝末节的，工作就轻松很多。每天会帮另一个同事测试一些功能点，协助他完成一些业务型不强的工作。今天早上到公司没多久，领导就在QQ上给我发信息，让我多协助同事测试，让我积极主动些，有点责任心等等，我听了这话，心里面立马凉半截，首先一个领导轻易说
读取android系统的联系人拨号百合不是茶 android sqlite数据库内容提供者系统服务的使用
联系人的姓名和号码是保存在不同的表中,不要一下子把号码查询来,我开始就是把姓名和电话同时查询出来的,导致系统非常的慢关键代码: 1, 使用javabean操作存储读取到的数据 package com.example.bean; /** * * @author Admini
ORACLE自定义异常 bijian1013 数据库自定义异常
实例： CREATE OR REPLACE PROCEDURE test_Exception ( ParameterA IN varchar2, ParameterB IN varchar2, ErrorCode OUT varchar2 --返回值,错误编码 ) AS /*以下是一些变量的定义*/ V1 NUMBER; V2 nvarc
查看端号使用情况征客丶 windows
一、查看端口在windows命令行窗口下执行： >netstat -aon|findstr "8080" 显示结果： TCP 127.0.0.1:80 0.0.0.0:0 &
【Spark二十】运行Spark Streaming的NetworkWordCount实例 bit1129 wordcount
Spark Streaming简介 NetworkWordCount代码 /* * Licensed to the Apache Software Foundation (ASF) under one or more * contributor license agreements. See the NOTICE file distributed with
Struts2 与 SpringMVC的比较 BlueSkator struts2 spring mvc
1. 机制：spring mvc的入口是servlet，而struts2是filter，这样就导致了二者的机制不同。 2. 性能：spring会稍微比struts快。spring mvc是基于方法的设计，而sturts是基于类，每次发一次请求都会实例一个action，每个action都会被注入属性，而spring基于方法，粒度更细，但要小心把握像在servlet控制数据一样。spring
Hibernate在更新时，是可以不用session的update方法的(转帖） BreakingBad Hibernate update
地址：http://blog.csdn.net/plpblue/article/details/9304459 public void synDevNameWithItil() {Session session = null;Transaction tr = null;try{session = HibernateUtil.getSession();tr = session.beginTran
读《研磨设计模式》-代码笔记-观察者模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.List; import java.util.Observable; import java.util.Observer; /** * “观
重置MySQL密码 chenhbc mysql 重置密码忘记密码
如果你也像我这么健忘，把MySQL的密码搞忘记了，经过下面几个步骤就可以重置了（以Windows为例，Linux/Unix类似）： 1、关闭MySQL服务 2、打开CMD，进入MySQL安装目录的bin目录下，以跳过权限检查的方式启动MySQL mysqld --skip-grant-tables 3、新开一个CMD窗口，进入MySQL mysql -uroot
再谈系统论，控制论和信息论 comsci 设计模式生物能源企业应用领域模型
再谈系统论，控制论和信息论偶然看
oracle moving window size与 AWR retention period关系 daizj oracle
转自： http://tomszrp.itpub.net/post/11835/494147 晚上在做11gR1的一个awrrpt报告时,顺便想调整一下AWR snapshot的保留时间,结果遇到了ORA-13541这样的错误.下面是这个问题的发生和解决过程. SQL> select * from v$version; BANNER -------------------
Python版B树 dieslrae python
话说以前的树都用java写的,最近发现python有点生疏了,于是用python写了个B树实现,B树在索引领域用得还是蛮多了,如果没记错mysql的默认索引好像就是B树... 首先是数据实体对象,很简单,只存放key,value class Entity(object): '''数据实体''' def __init__(self,key,value)
C语言冒泡排序 dcj3sjt126com 算法
代码示例： # include <stdio.h> //冒泡排序 void sort(int * a, int len) { int i, j, t; for (i=0; i<len-1; i++) { for (j=0; j<len-1-i; j++) { if (a[j] > a[j+1]) // >表示升序
自定义导航栏样式 dcj3sjt126com 自定义
-(void)setupAppAppearance { [[UILabel appearance] setFont:[UIFont fontWithName:@"FZLTHK—GBK1-0" size:20]]; [UIButton appearance].titleLabel.font =[UIFont fontWithName:@"FZLTH
11.性能优化-优化-JVM参数总结 frank1234 jvm参数性能优化
1.堆 -Xms --初始堆大小 -Xmx --最大堆大小 -Xmn --新生代大小 -Xss --线程栈大小 -XX:PermSize --永久代初始大小 -XX:MaxPermSize --永久代最大值 -XX:SurvivorRatio --新生代和suvivor比例,默认为8 -XX:TargetSurvivorRatio --survivor可使用
nginx日志分割 for linux HarborChung nginx linux 脚本
nginx日志分割 for linux 默认情况下，nginx是不分割访问日志的，久而久之，网站的日志文件将会越来越大，占用空间不说，如果有问题要查看网站的日志的话，庞大的文件也将很难打开，于是便有了下面的脚本使用方法，先将以下脚本保存为 cutlog.sh，放在/root 目录下，然后给予此脚本执行的权限复制代码代码如下: chmo
Spring4新特性——泛型限定式依赖注入 jinnianshilongnian spring spring4 泛型式依赖注入
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
centOS安装GCC和G++ liuxihope centos gcc
Centos支持yum安装，安装软件一般格式为yum install .......，注意安装时要先成为root用户。按照这个思路，我想安装过程如下：安装gcc：yum install gcc 安装g++： yum install g++ 实际操作过程发现，只能有gcc安装成功，而g++安装失败，提示g++ command not found。上网查了一下，正确安装应该
第13章 Ajax进阶（上） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
How to determine BusinessObjects service pack and fix pack blueoxygen BO
http://bukhantsov.org/2011/08/how-to-determine-businessobjects-service-pack-and-fix-pack/ The table below is helpful. Reference BOE XI 3.x 12.0.0. y BOE XI 3.0 12.0. x. y BO
Oracle里的自增字段设置 tomcat_oracle oracle
　大家都知道吧，这很坑，尤其是用惯了mysql里的自增字段设置，结果oracle里面没有的。oh，no 　　我用的是12c版本的，它有一个新特性，可以这样设置自增序列，在创建表是，把id设置为自增序列 create table t ( id 　　　　 number generated by default as identity (start with 1 increment b
Spring Security（01）——初体验 yang_winnie spring Security
Spring Security（01）——初体验博客分类： spring Security Spring Security入门安全认证首先我们为Spring Security专门建立一个Spring的配置文件，该文件就专门用来作为Spring Security的配置